Car manufacturers produced a variety of classic cars that continue to increase in value. Suppose the following data is based upon the Martin Rating System for Collectible Cars, and shows the rarity rating (1–20) and the high price ($1,000) for 15 classic cars.
Model | Rating | Price ($1,000) |
---|---|---|
A | 19 | 2,650.0 |
B | 13 | 45.0 |
C | 14 | 62.0 |
D | 18 | 1,025.0 |
E | 18 | 375.0 |
F | 15 | 102.5 |
G | 19 | 1,325.0 |
H | 18 | 1,575.0 |
I | 17 | 475.0 |
J | 16 | 250.0 |
K | 17 | 375.0 |
L | 19 | 4,000.0 |
M | 16 | 125.0 |
N | 16 | 375.0 |
O | 17 | 165.0 |
(a) Develop a scatter diagram of the data using the rarity rating as the independent variable and price as the independent variable.
Does a simple linear regression model appear to be appropriate?
No, there doesn't appear to be a relationship between the two variables.
No, there appears to be a curvilinear relationship between the two variables.
Yes, there appears to be a linear relationship between the two variables.
(b)Develop an estimated multiple regression equation with x = rarity rating and x2 as the two independent variables. (Round b0 and b1 to the nearest integer and b2 to one decimal place.)
ŷ =
(c) Consider the nonlinear relationship shown by equation (16.7): E(y) = β0β1x
Use logarithms to develop an estimated regression equation for this model. (Round b0 to three decimal places and b1 to four decimal places.)
log(ŷ) =
(d)Do you prefer the estimated regression equation developed in part (b) or part (c)? Explain.
The model in part (b) is preferred because r2 is higher and the p-value is lower.
The model in part (c) is preferred because r2 is higher and the p-value is lower.
The model in part (b) is preferred because r2 is lower and the p-value is lower.
The model in part (c) is preferred because r2 is lower and the p-value is lower.
We input the given data set in R and use the summary(lm(.)) and
plot(.) functions to answer the given questions.
The R code is given below.
x=c(19,13,14,18,18,15,19,18,17,16,17,19,16,16,17)
y=c(2650,45,62,1025,375,102.5,1325,1575,475,250,375,4000,125,375,165)
plot(x,y,pch=16,xlab="rating",ylab="price")
x2=x^2
mb=lm(y~x+x2)
mb1=lm(log(y)~x)
summary(mb)
summary(mb1)
(a) The scatter diagram is given below.
Does a simple linear regression model appear to be appropriate?
No, there appears to be a curvilinear relationship between
the two variables.
(b) ŷ = 33357 + (-4513) x + 151.9 x2.
(c) log(ŷ) = (-5.515) + 0.6839 x.
(d) The model in part (c) is preferred because r2
(0.8486) is higher and the p-value (0.000001093) is
lower.
Get Answers For Free
Most questions answered within 1 hours.