Question

Here is the data Stat7_prob3.txt : "FATALS","CUTTING" 270,15692 183,16198 319,17235 103,18463 149,18959 124,19103 62,19618 298,20436 330,21229...

Here is the data Stat7_prob3.txt :

"FATALS","CUTTING"
270,15692
183,16198
319,17235
103,18463
149,18959
124,19103
62,19618
298,20436
330,21229
486,18660
302,17551
373,17466
187,17388
347,15261
168,14731
234,14237
68,13216
162,12017
27,11845
40,11905
26,11881
41,11974
116,11892
84,11810
43,12076
292,12342
89,12608
148,13049
166,11656
32,13305
72,13390
27,13625
154,13865
44,14445
3,14424
3,14315
153,13761
11,12471
9,10960
17,9218
2,9054
5,9218
63,8817
41,7744
10,6907
3,6440
26,6021
52,5561
31,5309
3,5320
19,4784
10,4311
12,3663
88,3060
0,2779
41,2623
2,2058
5,1890
2,1535
0,1515
0,1595
23,1803
4,1495
0,1432

Here is the question :

Please Use R software/studio and provide all the R code and R output, please. Please answers all the questions (a & b). Pay attention to everything in Bold please. Show all work!

The file Stat7_prob3.txt contains data on the following two variables

  • FATALS: the annual number of fatalities from gas and dust explosions in coal mines for years 1915 to 1978.

  • CUTTING: the number of cutting machines in use

(a) Fit the regression model using FATALS as the dependent variable and CUTTING as the independent variable.

(b) Using appropriate residual plots and formal tests, investigate the violation of any assumptions. Do any assumptions of the linear regression model appear to be violated? If so, which one (or ones)? (Hint: Plot of residuals versus fitted values can be used for linearity, zero mean, and constant variance. Normal probability plot of the residuals can be used for normality. We also have formal tests for the constant variance and normality assumptions that you can do in R).

Homework Answers

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Here is the data Stat7_prob2.txt : "Team","WINS","HR","BA","ERA" "Anaheim Angels",99,152,.282,3.69 "Baltimore Orioles",67,165,.246,4.46 "Boston Red Sox",93,177,.277,3.75 "Chicago White...
Here is the data Stat7_prob2.txt : "Team","WINS","HR","BA","ERA" "Anaheim Angels",99,152,.282,3.69 "Baltimore Orioles",67,165,.246,4.46 "Boston Red Sox",93,177,.277,3.75 "Chicago White Sox",81,217,.268,4.53 "Cleveland Indians",74,192,.249,4.91 "Detroit Tigers",55,124,.248,4.93 "Kansas City Royals",62,140,.256,5.21 "Minnesota Twins",94,167,.272,4.12 "New York Yankees",103,223,.275,3.87 "Oakland Athletics",103,205,.261,3.68 "Seattle Mariners",93,152,.275,4.07 "Tampa Bay Devil Rays",55,133,.253,5.29 "Texas Rangers",72,230,.269,5.15 "Toronto Blue Jays",78,187,.261,4.8 "Arizona Diamondbacks",98,165,.267,3.92 "Atlanta Braves",101,164,.26,3.13 "Chicago Cubs",67,200,.246,4.29 "Cincinnati Reds",78,169,.253,4.27 "Colorado Rockies",73,152,.274,5.2 "Florida Marlins",79,146,.261,4.36 "Houston Astros",84,167,.262,4 "Los Angeles Dodgers",92,155,.264,3.69 "Milwaukee Brewers",56,139,.253,4.73 "Montreal Expos",83,162,.261,3.97 "New York Mets",75,160,.256,3.89 "Philadelphia Phillies",80,165,.259,4.17 "Pittsburgh Pirates",72,142,.244,4.23 "St. Louis Cardinales",97,175,.268,3.7 "San Diego Padres",66,136,.253,4.62 "San Francisco Giants",95,198,.267,3.54 Here...
A plot of the standardized residuals versus the fitted values for a multiple regression model: A....
A plot of the standardized residuals versus the fitted values for a multiple regression model: A. Can be used to check for outliers B. Can be used to check for independence of the errors and non-constant variance C. Can be used to check for non-constant variance D. Can be used to check for outliers and non-constant variance
In a regression analysis, what assumption can be checked with a normal probability plot of the...
In a regression analysis, what assumption can be checked with a normal probability plot of the residuals, and what should one look for in such a plot? (A) x and y are correlated. Look for a straight line pattern. (B) The residuals all have the same variance. Look for a straight line pattern. The vertical variation in the plot should be roughly constant throughout the whole range of fitted values. (C) The residuals follow a normal distribution. Look for a...
Problem 1: Oil Production Data: The Data in the following are the annual world crude oil          ...
Problem 1: Oil Production Data: The Data in the following are the annual world crude oil           production in millions of barrels for the period 1880-1988. The data are taken from Moore and McCabe( 1993, p. 147). Construct a scatter plot of the oil production variable (OIL) versus Year and observe that the scatter of points on the graph is not linear. In order to fit a linear model to these data, OIL must be transformed. Construct a scatter plot of...
Part 1 What assumptions need to be satisfied in order to perform ANOVA? Select all that...
Part 1 What assumptions need to be satisfied in order to perform ANOVA? Select all that apply. a) The observations from each group (or treatment or population) should adhere to approximately normal distributions. b) The variances (or standard deviations) of the groups (or treatments or populations) should be the same. c) The samples must be simple random samples, and independent of each other. d) The sample sizes for the groups must be exactly the same, and also happy. Part 2...
Question 1 How is a residual calculated in a regression model? i.e. what is the meaning...
Question 1 How is a residual calculated in a regression model? i.e. what is the meaning of a residual? a)The difference between the actual value, y, and the fitted value, y-hat b)The difference between the fitted value, y-hat, and the mean, y-bar c)The difference between the actual value, y, and the mean, y-ba d)The square of the difference between the fitted value, y-hat, and the mean, y-bar Question 2 Larger values of r-squared imply that the observations are more closely...
1.      The U.S. Department of Agriculture publishes data annually on various selected farm products. Shown here...
1.      The U.S. Department of Agriculture publishes data annually on various selected farm products. Shown here are the unit production figures (in millions of bushels) for three farm products for 10 years during a 20-year period. (leave 3 decimal places) Corn Soybeans Wheat 4152 1127 1352 6639 1798 2381 4175 1636 2420 7672 1861 2595 8876 2099 2424 8226 1940 2091 7131 1938 2108 4929 1549 1812 7525 1924 2037 7933 1922 2739 (4). Based on the following output, given...
A residual is: The difference between a data point and the regression line. A value that...
A residual is: The difference between a data point and the regression line. A value that can be 1 or zero. A value that is always negative because it is a difference The difference between two different lines. The properties of r include: r is sensitive to very high quantities The value of r is not affected if the values of either variable are converted into a different scale You must define the independent and dependent variables All of the...
Analysis of Variance Table Df Sum Sq Mean Sq F value Pr(>F) plastic 1 239735 239735...
Analysis of Variance Table Df Sum Sq Mean Sq F value Pr(>F) plastic 1 239735 239735 241.8709 2.31e-14 paper 1 11239 11239 11.3392 0.00245 garbage 1 2888 2888 2.9136 0.10023 moisture 1 411069 411069 414.7313 2.20e-16 Residuals 25 24779 991 (a)Please calculate the MSreg and MSres. Derive the F statistic (F=MSreg/MSres) (b) Use the F statistic to perform the overall F-test. We know that under H0, the sampling distribution of F statistic is an F distribution: F=MSreg/MSres ∼ F(p −...
Q1. Please download the dataset “Social W orkers” from Canvas and use Minitab for all the...
Q1. Please download the dataset “Social W orkers” from Canvas and use Minitab for all the analysis. The dataset contains salary and years of experience for 50 social workers. The consulting group working on these data is interested in evaluating how salary ($) changes as a person builds up years of experience (x) in the job. Let’s investigate this question using some of the concepts of linear regression discussed in class this far. a)Fit a linear regression model to these...