Question

If you run summary() command on a result of linear model fitting returned by lm(), you...

If you run summary() command on a result of linear model fitting returned by lm(), you will see a column Pr(>t). You could guess that it is a result of some t-test. How does it apply here? So far we have seen t-tests in the settings of testing sample locations. The ingredients were: 1) null hypothesis: in earlier cases we looked into the null that stated that two samples came from the distribution(s) with the same mean(s); 2) test statistic: for most examples so far we chose to look at the difference between the means of the two samples - maybe normalized by the (sample) standard deviation, if we are talking about t-test; 3) some calculation (either brute force resampling or analytical t-distribution) that told us whether the observed difference between the means is likely to be observed merely by random chance. That latter probability for the test statistic to reach the observed, or even larger, value under the null hypothesis is the p-value, which directly described the significance of the finding. In connection with the linear model y=ax+b+e (e is the random term), our findings that we want to characterize statistically are coefficients a,b (we just compute them straightforwardly from the data). Describe qualitatively: (1) why do we need statistical description (significance and all) in the first place? We got data, they do not lie exactly on a straight line, but we calculated least square line and that's the best fit we can get. Where is the randomness against which we want to calculate significance of our coefficients? (2) significance involves the null hypothesis. What do you think would be the null hypothesis in this case? It may help you to think what finding would be insignificant - what is your common sense telling you? (3) Now that we understand the source of randomness and formulated the null hypothesis, how would we test it? Again, I am looking for qualitative understanding only; if you think you see how an analytical test (e.g. t-test) could be applied, that's great; if you can describe (very briefly, a few sentences) the brute force sampling procedure that you would use to calculate the significance of the regression coefficient, e.g. slope, it's fine too. Again, I am not looking for math, but for the discussion of why linear regression is just another statistical problem, to which all the usual statistical ways of thinking should apply (randomness, underlying distributions vs estimators, significance levels, hypothesis testing).

Homework Answers

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
You are testing the null hypothesis that there is no linear relationship between two variables, X...
You are testing the null hypothesis that there is no linear relationship between two variables, X and Y. You are given the following regression results, where the sample size is 10., Coefficient Standart error Intercept -1.2 1 X 2 2 a) What is the value of the t test statistic? b) At the α = 0.05 level of significance, what are the critical values? c) Based on your answer to (a) and (b), what statistical decision should you make ?
Use the following linear regression equation to answer the questions. x1 = 1.1 + 3.0x2 –...
Use the following linear regression equation to answer the questions. x1 = 1.1 + 3.0x2 – 8.4x3 + 2.3x4 (a) Which variable is the response variable? x3 x1      x2 x4 Which variables are the explanatory variables? (Select all that apply.) x1 x2 x3 x4 (b) Which number is the constant term? List the coefficients with their corresponding explanatory variables. constant = x2 coefficient= x3 coefficient= x4 coefficient= (c) If x2 = 4, x3 = 10, and x4 = 6, what...
Use the following linear regression equation to answer the questions. x3 = ?16.6 + 3.7x1 +...
Use the following linear regression equation to answer the questions. x3 = ?16.6 + 3.7x1 + 8.6x4 ? 1.9x7 If x4 decreased by 2 units, what would we expect for the corresponding change in x3? (e) Suppose that n = 20 data points were used to construct the given regression equation and that the standard error for the coefficient of x4 is 0.924. Construct a 90% confidence interval for the coefficient of x4. (Round your answers to two decimal places.)...
A researcher analyzes the factors that may influence amusement park attendance. She estimates the following model:...
A researcher analyzes the factors that may influence amusement park attendance. She estimates the following model: Attendance = ?0 + ?1Price + ?2Temperature + ?3Rides + ?, where Attendance is the daily attendance (in 1,000s), Price is the gate price (in $), Temperature is the average daily temperature (in °F), and Rides is the number of rides at the amusement park. A portion of the regression results is shown in the accompanying table.    df SS MS F Significance F...
You are testing the null hypothesis that there is no linear relationship between two​ variables, X...
You are testing the null hypothesis that there is no linear relationship between two​ variables, X and Y. From your sample of n =18, you determine that b1=5.1 and Sb1 =1.1 a. What is the value of tSTAT​? b. At the α=0.05 level of​ significance, what are the critical​ values? c. Based on your answers to​ (a) and​ (b), what statistical decision should you​ make? d. Construct a​ 95% confidence interval estimate of the population​ slope, β1. I would greatly...
1. The P-value of a test of the null hypothesis is a. the probability the null...
1. The P-value of a test of the null hypothesis is a. the probability the null hypothesis is true. b. the probability the null hypothesis is false. c. the probability, assuming the null hypothesis is false, that the test statistic will take a value at least as extreme as that actually observed. d. the probability, assuming the null hypothesis is true, that the test statistic will take a value at least as extreme as that actually observed. 2. The P-value...
Statistical significance (alpha level; p-value) reflects the odds that a particular finding could have occurred by...
Statistical significance (alpha level; p-value) reflects the odds that a particular finding could have occurred by chance. If the p-value for a difference between two groups is 0.05, it would be expected to occur by chance just 5 times out of 100 (thus, it is likely to be a “real” difference). We define a p-value of less than 0.05 to indicate statistical significance. Suppose a pizza place claims their delivery times are 30 minutes or less on average but you...
Use the following linear regression equation to answer the questions. x1 = 1.5 + 3.5x2 –...
Use the following linear regression equation to answer the questions. x1 = 1.5 + 3.5x2 – 8.2x3 + 2.1x4 (a) Which variable is the response variable? A. x3 B. x1     C. x2 D. x4 (b) Which variables are the explanatory variables? (Select all that apply.) A. x4 B. x1 C. x3 D. x2 (c) Which number is the constant term? List the coefficients with their corresponding explanatory variables. constant ____________ x2 coefficient_________ x3 coefficient_________ x4 coefficient_________ (d) If x2 =...
In another study if the alternative hypothesis is µ1 < µ2 and you conclude in favor...
In another study if the alternative hypothesis is µ1 < µ2 and you conclude in favor of the alternative hypothesis in words what you conclude? The above alternative hypoethesis would be Nondirection al hypothesis Directional hypothesis Generally, the larger the samples, the smaller the standard errors become, increasing the likihood of finding statistical signficance. This statement is Calculating the standard error of the difference between means depends on the standard deviations as well as the sample sizes between the groups,...
T F 1. A p-value of .008 in hypothesis testing means there is only a .8%...
T F 1. A p-value of .008 in hypothesis testing means there is only a .8% chance we could get such sample statistics from the population if the null hypothesis is as stated. Such an event is considered unlikely and we would reject the null hypothesis. T F 2. As a general rule in hypothesis testing, it is always safer to set up your alternate hypothesis with a greater-than or less-than orientation. _____3. If the level of significance is .02...