Question

Sandra is the manager of human resources at ABC Inc. As part of his yearly report...

Sandra is the manager of human resources at ABC Inc. As part of his yearly report to the CEO she is required to present an analysis of the salaried employees. Because there are over 1,000 employees, she does not have the staff to gather information on each salaried employee, so she selects a random sample of 40. For each employee she records monthly salary, service at ABC (in months), gender (1=male, 0=female), and whether the employee has a technical or clerical job (technical=1, clerical=0).

The data referring to all 40 employees is presented in the table below.

Please use Excel/Data Analysis or Stats to answer the following questions:

Which are the dependent and which are the independent variables? Is there a linear relationship between the dependent and each of the independent variables?

Which independent variable has the strongest correlation with the dependent variable? Which independent variable has the weakest correlation with the dependent variable? Does it appear there will be any problems with multicollinearity?

Employee Salary Service Age Gender Job
1        1,945.9 93 42 0 0
2        1,914.0 104 33 1 0
3        2,135.1 104 42 0 1
4        2,603.7 126 57 1 0
5        2,713.7 98 30 1 1
6        1,804.0 99 49 1 1
7        1,931.6 94 35 1 0
8        1,876.6 96 46 0 1
9        1,943.7 124 56 0 0
10        1,320.0 73 23 0 1
11        1,876.6 110 67 0 1
12        2,183.5 90 36 0 1
13        1,710.5 104 53 0 0
14        1,923.9 81 29 0 0
15        2,261.6 106 45 1 0
16        1,901.9 113 55 0 1
17        2,404.6 129 46 1 1
18        2,043.8 97 39 0 0
19        2,000.9 101 43 1 1
20        1,485.0 91 35 0 1
21        2,233.0 100 40 1 0
22        2,805.0 123 59 1 0
23        1,698.4 88 30 0 0
24        1,942.6 117 60 1 1
25        2,130.7 107 45 1 1
26        1,860.1 105 32 0 1
27        1,785.3 86 33 0 0
28        1,970.1 131 56 0 1
29        2,201.1 95 30 1 1
30        2,061.4 98 47 0 0
31        2,211.0 120 60 1 1
32        1,870.0 87 29 0 0
33        1,844.7 100 65 0 0
34        2,082.3 105 27 0 1
35        2,136.2 86 37 1 0
36        1,735.8 93 39 1 1
37        2,871.0 97 47 1 0
38        1,939.3 100 42 0 0
39        2,075.7 105 40 1 1
40        2,436.5 127 49 0 1

Homework Answers

Answer #1

Sandra is the manager of human resources at ABC Inc. As part of his yearly report to the CEO she is required to present an analysis of the salaried employees. Because there are over 1,000 employees, she does not have the staff to gather information on each salaried employee, so she selects a random sample of 40. For each employee she records monthly salary, service at ABC (in months), gender (1=male, 0=female), and whether the employee has a technical or clerical job (technical=1, clerical=0).

The data referring to all 40 employees is presented in the table below.

Please use Excel/Data Analysis or Stats to answer the following questions:

Which are the dependent and which are the independent variables? Is there a linear relationship between the dependent and each of the independent variables?

Here dependent variable is salary and independent variables are service , age, gender and job.

This is the problem of multiple linear regression.

Here we have to test the hypothesis that,

H0 : There is no relationship between dependent variable and independent variable.

H1 : There is relationship between dependent variable and independent variable.

Assume alpha = level of significance = 0.05

We can do this test in MINITAB.

steps :

ENTER data into MINITAB sheet --> Stat --> Basic statistics --> Correlation --> Variables : select all the variables together --> Display p-values --> ok

————— 09-12-2018 20:28:40 ————————————————————

Correlation: Salary, Service, Age, Gender, Job

Salary Service Age Gender
Service 0.463
0.003

Age 0.234 0.700
0.147 0.000

Gender 0.495 0.198 0.079
0.001 0.220 0.629

Job -0.100 0.202 0.013 0.055
0.540 0.211 0.938 0.734


Cell Contents: Pearson correlation
P-Value

Conclusion :

The p-value for salary and service is 0.003.

P-value < alpha

Reject H0 at 5% level of significance.

There is relationship between salary and service.

P-value for salary and age is 0.147 which is greator than 0.05.

Accept H0 at 5% level of significance.

There is no relationship between salary and age.

P-value for salary and gender is 0.001 which is less than 0.05.

Reject H0 at 5% level of significance.

Conclusion : There is relationship between salary and gender.

P-value for salary and job is 0.540 which is greator than 0.05.

Accept H0 at 5% level of significance.

Conclusion : There is no relationship between salary and job.

Which independent variable has the strongest correlation with the dependent variable? Which independent variable has the weakest correlation with the dependent variable? Does it appear there will be any problems with multicollinearity?

Here we have to find multicollinearity factor.

The multicollinearity factor is VIF or variance inflation factor.

We can find VIF in MINITAB.

steps :

ENTER data into MINITAB sheet --> Stat --> Regression --> Regression --> Fit regression model --> Responses : salary --> COntinuous predictors : select all the independent variables --> ok

Coefficients

Term Coef SE Coef T-Value P-Value VIF
Constant 895 319 2.81 0.008
Service 13.09 4.38 2.99 0.005 2.20
Age -5.36 5.17 -1.04 0.307 2.04
Gender 263.4 82.7 3.18 0.003 1.05
Job -148.7 83.6 -1.78 0.084 1.08

Here VIF < 10 therefore multicollinearity is low.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
y   x1   x2   x3   x4 64   74   22   24   17 43   63   29   15   30 51  ...
y   x1   x2   x3   x4 64   74   22   24   17 43   63   29   15   30 51   78   20   9   25 49   52   17   38   29 39   45   12   19   37 Consider the set of dependent and independent variables given below. Perform a best subsets regression and choose the most appropriate model for these data. Find the most appropriate model for the data. Note that the coefficient is 0 for any variable that is not included in the model. y= _____+...
have a java application need to create an application which is able to do some analysis...
have a java application need to create an application which is able to do some analysis on temperature data stored in a data file. You will be given the “temperatures.dat” data file which contains the data you must analyze. The analysis you’ll need to do is: Total number of data points Find coldest temperature Find warmest temperature Find average temperature Find the frequency of each temperature Find the most frequent temperature Find the least frequent temperature All classes must be...
Use the given data that represent the waiting time of a randomly selected group of people...
Use the given data that represent the waiting time of a randomly selected group of people at an urgent-care clinic and a randomly selected group of people at an emergency clinic. a. Rearrange the data so that they can be analyzed using regression, and add an independent variable that is 1 for Emergency and 0 Urgent care. b. Test the hypothesis that waiting time is equal for both groups (against the alternative that they are not equal) using regression and...
Student Grades Student Test Grade 1 76 62 2 84 90 3 79 68 4 88...
Student Grades Student Test Grade 1 76 62 2 84 90 3 79 68 4 88 84 5 76 58 6 66 79 7 75 73 8 94 93 9 66 65 10 92 86 11 80 53 12 87 83 13 86 49 14 63 72 15 92 87 16 75 89 17 69 81 18 92 94 19 79 78 20 60 71 21 68 84 22 71 74 23 61 74 24 68 54 25 76 97...
As part of the quarterly reviews, the manager of a retail store analyzes the quality of...
As part of the quarterly reviews, the manager of a retail store analyzes the quality of customer service based on the periodic customer satisfaction ratings (on a scale of 1 to 10 with 1 = Poor and 10 = Excellent). To understand the level of service quality, which includes the waiting times of the customers in the checkout section, he collected data on 100 customers who visited the store; see the attached Excel file: ServiceQuality. Using Data Mining > Cluster,...
This dataset contains consumer responses indicating the number of times they had to send their product...
This dataset contains consumer responses indicating the number of times they had to send their product for repair and their satisfaction with the repair process. Create a graph which can be used to visually demonstrate the relationship between the two columns of data. Ensure that the chart is professional with appropriate titles, axis labels, etc. Note any observations you see in your visualization (type these as sentences directly into an Excel cell(s)). Sample Satisfaction Rating Repair Requests 1 63% 13...
Kuya conducted a study to see if, among smokers, that there is a significant difference in...
Kuya conducted a study to see if, among smokers, that there is a significant difference in the sense of smoking urge between individuals in various levels of administration.  To measure the urge to smoke, the Chronic Habit Obliging Killer Emphysema (CHOKE) test was used.  Type of profession was categorized to Upper Management, Middle Management, and Lower Management.  The higher the score, the stronger the urge.  Is there a significant difference in their urge? UPPER MIDDLE LOWER 44 47 31 45 48 32 50 43...
Part C: Regression and Correlation Analysis Use the dependent variable (labeled Y) and the independent variables...
Part C: Regression and Correlation Analysis Use the dependent variable (labeled Y) and the independent variables (labeled X1, X2, and X3) in the data file. Use Excel to perform the regression and correlation analysis to answer the following. Generate a scatterplot for the specified dependent variable (Y) and the X1 independent variable, including the graph of the "best fit" line. Interpret. Determine the equation of the "best fit" line, which describes the relationship between the dependent variable and the selected...
Using the accompanying Student Grades​ data, construct a scatter chart for midterm versus final exam grades...
Using the accompanying Student Grades​ data, construct a scatter chart for midterm versus final exam grades and add a linear trendline. What is the​ model? If a student scores 7878 on the​ midterm, what would you predict her grade on the final exam to​ be? Student Midterm Final Exam 1 75 64 2 85 91 3 80 68 4 88 83 5 76 60 6 67 80 7 78 74 8 95 94 9 67 61 10 93 87 11...
Call centers typically have high turnover. The director of human resources for a large bank has...
Call centers typically have high turnover. The director of human resources for a large bank has compiled data on about 70 former employees at one of the bank�s call centers in the Excel file Call Center Data . In writing an article about call center working conditions, a reporter has claimed that the average tenure is no more than two years. Formulate and test a hypothesis using these data to determine if this claim can be disputed. Call Center Data...