Question

A car insurance company performed a study to determine whether an association exists between age and...

A car insurance company performed a study to determine whether an association exists between age and the frequency of car accidents. They obtained the following sample data.

under 25 25-45 over 45 total
number of 0 74 90 84 248
accidents in 1 19 8 12 39
past 3 years >1 7 2 4 13
total 100 100 100 300

Conduct a test, at the 5% significance level, to determine whether the data provide sufficient evidence to conclude that an association exists between age and frequency of car accidents.

(a) State the null and alternative hypotheses for this test.

(b) Calculate:

(i) the expected frequency, E31, for the number of accidents is > 1 and age is under 25 years?

(ii) the chi-square contribution, χ2, for the number of accidents is > 1 and age is under 25 years?

(iii) the p-value for this test?

(c) Given that the chi-square test statistic, χ2, is 9.273, state the conclusion for the test.

(d) The sample data was inputted into a statistical software for analysis. The output generated included the following warning: Warning: 3 cells (33.3%) have expected count less than 5. The minimum expected count is 4.33.

(i) Why was this warning displayed?

(ii) How does this affect the conclusion stated in part (c)?

(iii) Suggest how this problem could be resolved?

Homework Answers

Answer #1

Given table data is as below

MATRIX col1 col2 col3 TOTALS
row 1 74 90 84 248
row 2 19 8 12 39
row 3 7 2 4 13
TOTALS 100 100 100 N = 300

------------------------------------------------------------------

calculation formula for E table matrix

E-TABLE col1 col2 col3
row 1 row1*col1/N row1*col2/N row1*col3/N
row 2 row2*col1/N row2*col2/N row2*col3/N
row 3 row3*col1/N row3*col2/N row3*col3/N

------------------------------------------------------------------

expected frequecies calculated by applying E - table matrix formulae

E-TABLE col1 col2 col3
row 1 82.667 82.667 82.667
row 2 13 13 13
row 3 4.333 4.333 4.333

------------------------------------------------------------------

calculate chisquare test statistic using given observed frequencies, calculated expected frequencies from above

Oi Ei Oi-Ei (Oi-Ei)^2 (Oi-Ei)^2/Ei
74 82.667 -8.667 75.117 0.909
90 82.667 7.333 53.773 0.65
84 82.667 1.333 1.777 0.021
19 13 6 36 2.769
8 13 -5 25 1.923
12 13 -1 1 0.077
7 4.333 2.667 7.113 1.642
2 4.333 -2.333 5.443 1.256
4 4.333 -0.333 0.111 0.026
chisqr^2 o = 9.273

------------------------------------------------------------------

set up null vs alternative as

null, Ho: no association exists between age and frequency of car accidents

alternative, H1: exists a relation b/w age and frequency of car accidents OR an association exists between age and frequency of car accidents

level of significance, alpha = 0.05

from standard normal table, chi square value at right tailed, chisqr^2 alpha/2 =9.488

since our test is right tailed,reject Ho when chisqr^2 o > 9.488

we use test statistic chisqr^2 o = Σ(Oi-Ei)^2/Ei

from the table , chisqr^2 o = 9.273

critical value

the value of |chisqr^2 alpha| at los 0.05 with d.f (r-1)(c-1)= ( 3 -1 ) * ( 3 - 1 ) = 2 * 2 = 4 is 9.488

we got | chisqr^2| =9.273 & | chisqr^2 alpha | =9.488

make decision

hence value of | chisqr^2 o | < | chisqr^2 alpha | and here we do not reject Ho

chisqr^2 p_value =0.055

---------------

A.

null, Ho: no association exists between age and frequency of car accidents

alternative, H1: exists a relation b/w age and frequency of car accidents OR an association exists between age and frequency of car accidents

B.

(i) the expected frequency, E31 = 4.333 > 1 and age is under 25 years

(ii)test statistic: 9.273

(iii)p-value:0.055

C.

test statistic: 9.273, decision: do not reject Ho

D.

i)
Warning: 3 cells (33.3%) have expected count less than 5, because
three of the expected values are 4.33 <5, and chi square rules explains that
every provided cell observation values should be greater than 5
ii)
In chi-square test we compare observed frequency (that we measure directly
from the data) with the expected frequency. We calculate expected frequency
(in Chi-square for independence) forr each cell. The assumption suggests that no
cell should have expected frequency of less than 5. Or else. the test is not valid.
iii)
It is possible to ‘poor or ‘collapse’ categories into fewer. but this must only be
done if it is meaningful to group the data in this way. Else use Fisher’s exact
test.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
A large corporation was interested in determining whether an association exists between commuting time of their...
A large corporation was interested in determining whether an association exists between commuting time of their employees and the level of stress-related problems observed on the job. A study of 116 assembly-line workers revealed the following: Commuting Time Stress High Moderate Low Under 15 min. 15 6 21 15 min. to 45 min. 9 9 32 Over 45 min. 21 7 5 At the 5% significance level, is there evidence of a relationship between commuting time and stress? H0:____________________________      HA:...
The table below shows the age and favourite type of music of 668 randomly selected people....
The table below shows the age and favourite type of music of 668 randomly selected people. Rock Pop Classical 15-25 50 85 73 25-35 68 91 60 35-45 90 74 77 Use a 5 percent level of significance to test the null hypothesis that age and preferred music type are independent. (a) State the null and alternative hypotheses for this test. [2 marks] (b) Calculate: the expected frequency, E21, for age 25-35 years and favourite music is Rock (ii) the...
Using α = 0.01, test that there was no relationship between survival and age category. Refer...
Using α = 0.01, test that there was no relationship between survival and age category. Refer to the output in part (a). State the null and alternative hypotheses. Report the value of the appropriate test statistic, the distribution of the test statistic under the null hypothesis, and the P-value of the test to answer the question. State your conclusion. Output from part A Contingency table results: Rows: Survival Columns: Bin(Age) Cell format Count (Row percent) (Column percent) (Percent of total)...
A study was done to investigate whether a relationship existed between HPV (human papillomavirus) status and...
A study was done to investigate whether a relationship existed between HPV (human papillomavirus) status and HIV infection status. HPV status and HIV infection status were obtained for 97 women. Table of hpv by hiv hpv hiv Frequency Expected Row Pct Seropositive/Symptomatic Seropositive/Asymptomatic Seronegative Total Positive 23 12.928 60.53 5 7.4433 13.16 10 17.629 26.32 38 Negative 10 20.072 16.95 14 11.557 23.73 35 27.371 59.32 59 Total 33 19 45 97 Statistic DF Value Prob Chi-Square 2 19.6478 <.0001...
market researcher tests whether watching a particular soap opera depends on the viewer's age. A random...
market researcher tests whether watching a particular soap opera depends on the viewer's age. A random sample of adult viewers of the The Young and the Restless is collected over a 30-day period; data are presented below:                                                Watched? Viewer's age                 Yes             No 18-25 years 33 40 26-40 78 55 41+ years 18 84 Part a: State the null hypothesis for a two-way chi-square test involving watch status (Yes/No) and viewer's age. [3 points] Part b: What is the expected...
(Q24-Q27) An insurance company has gathered the information regarding the number of accidents reported per day...
(Q24-Q27) An insurance company has gathered the information regarding the number of accidents reported per day over a period of 100 days. The data can be found on the fourth sheet labeled “Accidents” in the “INFO1020 Final Exam DataFile.xlsx”. With the data, you are conducting a goodness-of-fit test to see whether the number of accidents per day can have a Poisson distribution. 24.What is the expected frequency of exactly 2 accidents per day? 25.What is the Chi-square test statistics? Round...
(Q24-Q27) An insurance company has gathered the information regarding the number of accidents reported per day...
(Q24-Q27) An insurance company has gathered the information regarding the number of accidents reported per day over a period of 100 days. The data can be found on the fourth sheet labeled “Accidents” in the “INFO1020 Final Exam DataFile.xlsx”. With the data, you are conducting a goodness-of-fit test to see whether the number of accidents per day can have a Poisson distribution. 24.What is the expected frequency of exactly 2 accidents per day? 25.What is the Chi-square test statistics? Round...
Among drivers who have had car crashes in the last year, the data was collected by...
Among drivers who have had car crashes in the last year, the data was collected by age of driver and appears in the table below. If the ages all had the same crash rate we would expect the probability (percentage) shown in the table as well. (This is adjusted to account for the percentage of that age group in the driving population as a whole.) Using the methods that you learned with Chi-Square Goodness of Fit, test the data at...
5. Car Crashes and Age Brackets Among drivers who have had a car crash in the...
5. Car Crashes and Age Brackets Among drivers who have had a car crash in the last year, 88 are randomly selected and categorized by age, with the results listed in the accompanying table (based on data from the Insurance Information Institute). If all ages have the same crash rate, we would expect (because of the age distribution of licensed drivers) the given categories to have 16%, 44%, 27%, and 13% of the subjects, respectively. At the 0.05 significance level,...
A researcher records whether n=280n=280 children do or do not have asthma at age 13, and...
A researcher records whether n=280n=280 children do or do not have asthma at age 13, and then again records whether these same 280280 individuals have asthma 7 years later. Results are shown in the following table: Asthma at age 20? Asthma at age 13? Yes No Yes 85 17 No 5 173 1. What is the expected frequency of "asthma at 13" and "asthma at 20"? [Answer to 2 decimal places.] 2. What is the expected frequency of "asthma at...