Question

2) Airline accidents: According to the U.S. National Transportation Safety Board, the number of airline accidents...

2) Airline accidents: According to the U.S. National Transportation Safety Board, the number of airline accidents by year from 1983 to 2006 were 23, 16, 21, 24, 34, 30, 28, 24, 26, 18, 23, 23, 36, 37, 49, 50, 51, 56, 46, 41, 54, 30, 40, and 31.

a. For the sample data, compute the mean and its standard error (from the standard deviation), and the median.

b. Using R, compute bootstrap estimates of the mean, median and 25% trimmed mean with estimates of their standard errors, using B = 1000 resamples. (Use set.seed(1) to allow your results to be reproduced.) c.

Compare parts a and b. How do the estimates compare?

3) Consider a population that has a normal distribution with mean µ = 36, standard deviation σ = 8. a. The sampling distribution of ?̅ for samples of size 500 will have what distribution, mean and standard error? b. Use R to draw a random sample of size 500 from thus population. Conduct exploratory data analysis on your sample. c. Compute the bootstrap distribution for your sample and note the bootstrap mean and standard error. d. Compare the bootstrap distribution to the theoretical sampling distribution and create a table as given below. Mean Standard Deviation Population Sample distribution of ?̅ Sample Bootstrap distribution e. Repeat for sample of sizes n = 50 and n = 10. Carefully describe your observations about the effects of sample size on the bootstrap distribution.

4) Suppose we have a random sample from the Binomial distribution. Here X represents the number of successes in n trials with probability of success given by θ. Data: 6 5 5 5 7 4 ∼ binomial(8,θ) a. Estimate θ. Think what will be a parametric estimate for θ based on the given sample. b. Write out the R code to generate data of 100 parametric bootstrap samples and compute an 80% percentile confidence interval for θ. (Hint: You will generate random samples of size equal to the given sample size from a Binomial with n = 8 and using the estimate of θ that you get in

part a.)a. Estimate θ. Think what will be a parametric estimate for θ based on the given sample. b. Write out the R code to generate data of 100 parametric bootstrap samples and compute an 80% percentile confidence interval for θ. (Hint: You will generate random samples of size equal to the given sample size from a Binomial with n = 8 and using the estimate of θ that you get in part a.)

5) Will students wait times for the arrival of an instructor who is a full professor differ from the wait times for an instructor who is a graduate student? This question was investigated by counting how many minutes undergraduate students waiting in two small seminar classes, one taught by a full professor and one taught by a graduate student. Suppose we want to examine if the time waited is different for the full professor than for the graduate student. a. Propose a parametric approach to answer this question. Mention clearly all assumptions for such an approach b. Compare your approach in part a. to a bootstrap approach for a two-sample problem. Clearly state your conlusions in relation to this problem. Graduate Student instructor 9 11 14 14 16 19 37 Full Professor 28 23 18 16 15 15 13 31 31

Homework Answers

Answer #1

the number of airline accidents by year from 1983 to 2006 were

23, 16, 21, 24, 34, 30, 28, 24, 26, 18, 23, 23, 36, 37, 49, 50, 51, 56, 46, 41, 54, 30, 40, and 31.

a. mean of sample data is

x̄ = ( sum of observations ) / ( number of observations)

= ( 23+16+21+24+34+30+28+24+26+18+23+23+36+37+49+50+51+56+46+41+54+30+40+31)/24

= 811/24

= 33.7917

the average number of airline accidents by year 1983 to 2006 is 33.7917

standard error = σ/√n

  σ = √(3347.86/24)

= √(139.4941)

= 11.8107

standard error = (11.8107)/(√24)

= 2.4108

standard error of geven sample data is 2.4108

median : arrange the data in ascending order

16, 18, 21, 23, 23, 23, 24, 24, 26, 28, 30,30, 31, 34, 36, 37, 40, 41, 46, 49, 50, 51, 54, 56,

median = ( n+1)/2 th value

= (30+31)/2

= 30.5

median of given sample data is 30.5

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Use R. Generate a random sample with n=15 random observations from an exponential distribution with mean=1....
Use R. Generate a random sample with n=15 random observations from an exponential distribution with mean=1. Calculate the sample median, which is an estimator of the population median. Use bootstrap (nonparametric, with B=1000) methods to estimate the variance of the estimator for the population median. use the Monte Carlo method, e.g. generate 1000 samples of size 15 to estimate the true variance of the median estimator. Compare and comment on your results.
2. Suppose the National Transportation Safety Board (NTSB) wants to examine the safety of compact cars,...
2. Suppose the National Transportation Safety Board (NTSB) wants to examine the safety of compact cars, midsize cars, and full-size cars. It collects a sample of three for each of the treatments (cars types). Using the hypothetical data provided below, test whether the mean pressure applied to the driver’s head during a crash test is significantly different for each type of car. Use ? = 5% to hand calculate and check your work in SPSS. Compact cars 6,6,7 Midsize cars...
This problem is to be done in R. When computing the bootstrap, there are two extremes...
This problem is to be done in R. When computing the bootstrap, there are two extremes we talked about in class: either completely enumerating all of the possible cases (which is computationally infeasible most of the time), or drawing of a few samples at random with possibility of repetition. In between these is an algorithm known as the Balanced Bootstrap. The algorithm goes as follows: Generate a list of B repetitions of each of the observations in our original data...
Use the programming language R to code the following project.. * Make sure you turn in...
Use the programming language R to code the following project.. * Make sure you turn in your code and answers from each question. (not the raw data). 1. Generate 1000 random samples of size 40 from the normal distribution with mean µ = 3 and standard deviation σ = 2. Compute 95% the confidence interval of 1000 samples and find the rate of confidence interval contains the true mean. What did you learn from this simulation study? 2. For each...
When σ is unknown and the sample is of size n ≥ 30, there are two...
When σ is unknown and the sample is of size n ≥ 30, there are two methods for computing confidence intervals for μ. Method 1: Use the Student's t distribution with d.f. = n − 1. This is the method used in the text. It is widely employed in statistical studies. Also, most statistical software packages use this method. Method 2: When n ≥ 30, use the sample standard deviation s as an estimate for σ, and then use the...
When ? is unknown and the sample is of size n?30, there are two methods for...
When ? is unknown and the sample is of size n?30, there are two methods for computing confidence intervals for ?. Method 1: Use the Student's t distribution with d.f.= n?1. This is the method used in the text. It is widely employed in statistical studies. Also, most statistical software packages use this method. Method 2: When n ? 30, use the sample standard deviation sas an estimate for ?, and then use the standard normal distribution. This method is...
Suppose your friend Joanna is running for class president. The proportion of individuals in a population...
Suppose your friend Joanna is running for class president. The proportion of individuals in a population of students who will vote for Joanna on election day is 60%. You plan to conduct a poll of size n and report X, the number of individuals in your poll who plan to vote for Joanna. You also plan to compute ?̂=??, the proportion of individuals in your poll who plan to vote for Joanna. a) Explain why X is a binomial random...
When σ is unknown and the sample is of size n ≥ 30, there are two...
When σ is unknown and the sample is of size n ≥ 30, there are two methods for computing confidence intervals for μ. Method 1: Use the Student's t distribution with d.f. = n − 1. This is the method used in the text. It is widely employed in statistical studies. Also, most statistical software packages use this method. Method 2: When n ≥ 30, use the sample standard deviation s as an estimate for σ, and then use the...
When σ is unknown and the sample is of size n ≥ 30, there are two...
When σ is unknown and the sample is of size n ≥ 30, there are two methods for computing confidence intervals for μ. Method 1: Use the Student's t distribution with d.f. = n − 1. This is the method used in the text. It is widely employed in statistical studies. Also, most statistical software packages use this method. Method 2: When n ≥ 30, use the sample standard deviation s as an estimate for σ, and then use the...
-0.021 0.029 -0.009 -0.002 0.002 -0.006 0.006 -0.064 0.023 0.031 Table 1: A sample of FIR...
-0.021 0.029 -0.009 -0.002 0.002 -0.006 0.006 -0.064 0.023 0.031 Table 1: A sample of FIR Your friend seek help from you to use your knowledge from TSTA602 to assist Bank X to do the following data analysis. Please write a report to answer all the following questions. (a). calculate (mannually) the sample mean, and sample standard de- viation for the sample in Table 1. (b). if we draw samples of sizes 10 many times and form a distribution of...