Question

Sandra is the manager of human resources at ABC Inc. As part of his yearly report...

Sandra is the manager of human resources at ABC Inc. As part of his yearly report to the CEO she is required to present an analysis of the salaried employees. Because there are over 1,000 employees, she does not have the staff to gather information on each salaried employee, so she selects a random sample of 40. For each employee she records monthly salary, service at ABC (in months), gender (1=male, 0=female), and whether the employee has a technical or clerical job (technical=1, clerical=0).

The data referring to all 40 employees is presented in the table below.

Please use Excel/Data Analysis or Stats to answer the following questions:

Which are the dependent and which are the independent variables? Is there a linear relationship between the dependent and each of the independent variables?

Which independent variable has the strongest correlation with the dependent variable? Which independent variable has the weakest correlation with the dependent variable? Does it appear there will be any problems with multicollinearity?

Employee Salary Service Age Gender Job
1        1,945.9 93 42 0 0
2        1,914.0 104 33 1 0
3        2,135.1 104 42 0 1
4        2,603.7 126 57 1 0
5        2,713.7 98 30 1 1
6        1,804.0 99 49 1 1
7        1,931.6 94 35 1 0
8        1,876.6 96 46 0 1
9        1,943.7 124 56 0 0
10        1,320.0 73 23 0 1
11        1,876.6 110 67 0 1
12        2,183.5 90 36 0 1
13        1,710.5 104 53 0 0
14        1,923.9 81 29 0 0
15        2,261.6 106 45 1 0
16        1,901.9 113 55 0 1
17        2,404.6 129 46 1 1
18        2,043.8 97 39 0 0
19        2,000.9 101 43 1 1
20        1,485.0 91 35 0 1
21        2,233.0 100 40 1 0
22        2,805.0 123 59 1 0
23        1,698.4 88 30 0 0
24        1,942.6 117 60 1 1
25        2,130.7 107 45 1 1
26        1,860.1 105 32 0 1
27        1,785.3 86 33 0 0
28        1,970.1 131 56 0 1
29        2,201.1 95 30 1 1
30        2,061.4 98 47 0 0
31        2,211.0 120 60 1 1
32        1,870.0 87 29 0 0
33        1,844.7 100 65 0 0
34        2,082.3 105 27 0 1
35        2,136.2 86 37 1 0
36        1,735.8 93 39 1 1
37        2,871.0 97 47 1 0
38        1,939.3 100 42 0 0
39        2,075.7 105 40 1 1
40        2,436.5 127 49 0 1

Homework Answers

Answer #1

Sandra is the manager of human resources at ABC Inc. As part of his yearly report to the CEO she is required to present an analysis of the salaried employees. Because there are over 1,000 employees, she does not have the staff to gather information on each salaried employee, so she selects a random sample of 40. For each employee she records monthly salary, service at ABC (in months), gender (1=male, 0=female), and whether the employee has a technical or clerical job (technical=1, clerical=0).

The data referring to all 40 employees is presented in the table below.

Please use Excel/Data Analysis or Stats to answer the following questions:

Which are the dependent and which are the independent variables? Is there a linear relationship between the dependent and each of the independent variables?

Here dependent variable is salary and independent variables are service , age, gender and job.

This is the problem of multiple linear regression.

Here we have to test the hypothesis that,

H0 : There is no relationship between dependent variable and independent variable.

H1 : There is relationship between dependent variable and independent variable.

Assume alpha = level of significance = 0.05

We can do this test in MINITAB.

steps :

ENTER data into MINITAB sheet --> Stat --> Basic statistics --> Correlation --> Variables : select all the variables together --> Display p-values --> ok

————— 09-12-2018 20:28:40 ————————————————————

Correlation: Salary, Service, Age, Gender, Job

Salary Service Age Gender
Service 0.463
0.003

Age 0.234 0.700
0.147 0.000

Gender 0.495 0.198 0.079
0.001 0.220 0.629

Job -0.100 0.202 0.013 0.055
0.540 0.211 0.938 0.734


Cell Contents: Pearson correlation
P-Value

Conclusion :

The p-value for salary and service is 0.003.

P-value < alpha

Reject H0 at 5% level of significance.

There is relationship between salary and service.

P-value for salary and age is 0.147 which is greator than 0.05.

Accept H0 at 5% level of significance.

There is no relationship between salary and age.

P-value for salary and gender is 0.001 which is less than 0.05.

Reject H0 at 5% level of significance.

Conclusion : There is relationship between salary and gender.

P-value for salary and job is 0.540 which is greator than 0.05.

Accept H0 at 5% level of significance.

Conclusion : There is no relationship between salary and job.

Which independent variable has the strongest correlation with the dependent variable? Which independent variable has the weakest correlation with the dependent variable? Does it appear there will be any problems with multicollinearity?

Here we have to find multicollinearity factor.

The multicollinearity factor is VIF or variance inflation factor.

We can find VIF in MINITAB.

steps :

ENTER data into MINITAB sheet --> Stat --> Regression --> Regression --> Fit regression model --> Responses : salary --> COntinuous predictors : select all the independent variables --> ok

Coefficients

Term Coef SE Coef T-Value P-Value VIF
Constant 895 319 2.81 0.008
Service 13.09 4.38 2.99 0.005 2.20
Age -5.36 5.17 -1.04 0.307 2.04
Gender 263.4 82.7 3.18 0.003 1.05
Job -148.7 83.6 -1.78 0.084 1.08

Here VIF < 10 therefore multicollinearity is low.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
y   x1   x2   x3   x4 64   74   22   24   17 43   63   29   15   30 51  ...
y   x1   x2   x3   x4 64   74   22   24   17 43   63   29   15   30 51   78   20   9   25 49   52   17   38   29 39   45   12   19   37 Consider the set of dependent and independent variables given below. Perform a best subsets regression and choose the most appropriate model for these data. Find the most appropriate model for the data. Note that the coefficient is 0 for any variable that is not included in the model. y= _____+...
Student Grades Student Test Grade 1 76 62 2 84 90 3 79 68 4 88...
Student Grades Student Test Grade 1 76 62 2 84 90 3 79 68 4 88 84 5 76 58 6 66 79 7 75 73 8 94 93 9 66 65 10 92 86 11 80 53 12 87 83 13 86 49 14 63 72 15 92 87 16 75 89 17 69 81 18 92 94 19 79 78 20 60 71 21 68 84 22 71 74 23 61 74 24 68 54 25 76 97...
As part of the quarterly reviews, the manager of a retail store analyzes the quality of...
As part of the quarterly reviews, the manager of a retail store analyzes the quality of customer service based on the periodic customer satisfaction ratings (on a scale of 1 to 10 with 1 = Poor and 10 = Excellent). To understand the level of service quality, which includes the waiting times of the customers in the checkout section, he collected data on 100 customers who visited the store; see the attached Excel file: ServiceQuality. Using Data Mining > Cluster,...
This dataset contains consumer responses indicating the number of times they had to send their product...
This dataset contains consumer responses indicating the number of times they had to send their product for repair and their satisfaction with the repair process. Create a graph which can be used to visually demonstrate the relationship between the two columns of data. Ensure that the chart is professional with appropriate titles, axis labels, etc. Note any observations you see in your visualization (type these as sentences directly into an Excel cell(s)). Sample Satisfaction Rating Repair Requests 1 63% 13...
Kuya conducted a study to see if, among smokers, that there is a significant difference in...
Kuya conducted a study to see if, among smokers, that there is a significant difference in the sense of smoking urge between individuals in various levels of administration.  To measure the urge to smoke, the Chronic Habit Obliging Killer Emphysema (CHOKE) test was used.  Type of profession was categorized to Upper Management, Middle Management, and Lower Management.  The higher the score, the stronger the urge.  Is there a significant difference in their urge? UPPER MIDDLE LOWER 44 47 31 45 48 32 50 43...
Part C: Regression and Correlation Analysis Use the dependent variable (labeled Y) and the independent variables...
Part C: Regression and Correlation Analysis Use the dependent variable (labeled Y) and the independent variables (labeled X1, X2, and X3) in the data file. Use Excel to perform the regression and correlation analysis to answer the following. Generate a scatterplot for the specified dependent variable (Y) and the X1 independent variable, including the graph of the "best fit" line. Interpret. Determine the equation of the "best fit" line, which describes the relationship between the dependent variable and the selected...
Using the accompanying Student Grades​ data, construct a scatter chart for midterm versus final exam grades...
Using the accompanying Student Grades​ data, construct a scatter chart for midterm versus final exam grades and add a linear trendline. What is the​ model? If a student scores 7878 on the​ midterm, what would you predict her grade on the final exam to​ be? Student Midterm Final Exam 1 75 64 2 85 91 3 80 68 4 88 83 5 76 60 6 67 80 7 78 74 8 95 94 9 67 61 10 93 87 11...
Call centers typically have high turnover. The director of human resources for a large bank has...
Call centers typically have high turnover. The director of human resources for a large bank has compiled data on about 70 former employees at one of the bank�s call centers in the Excel file Call Center Data . In writing an article about call center working conditions, a reporter has claimed that the average tenure is no more than two years. Formulate and test a hypothesis using these data to determine if this claim can be disputed. Call Center Data...
Managerial Report, Chapter 15 (modified) 1. Develop the following estimated regression equations, using Amount Charged as...
Managerial Report, Chapter 15 (modified) 1. Develop the following estimated regression equations, using Amount Charged as your Dependent variable: a. First using annual income as the Independent variable b. Second using household size as the Independent variable Which variable is the better predictor of annual credit card charges? To answer this question, provide and interpret your R2 for each model. 2. Develop an estimated regression equation with annual income and household size as the Independent variables. As your Dependent variable,...
The file P17_05.xlsx contains data on 100 consumers who drink beer. Some of them prefer light...
The file P17_05.xlsx contains data on 100 consumers who drink beer. Some of them prefer light beer, and others prefer regular beer. A major beer producer believes that the following variables might be useful in discriminating between these two groups: gender, marital status, annual income level, and age. b. Consider a new customer: male, married, income $42,000, age 47. Use the logistic regression equation to estimate the probability that this customer prefers Regular. How would you classify this person? Individual...