Question

if we have a sample size after removing the outliers we predicted amodel by 70% of...

if we have a sample size after removing the outliers we predicted amodel by 70% of the data and 30% of the data for R2 ,the question that i want to ask when we want to make statistical analeses for verification of the data (we use the total sample after removing the outliers in colmogorove smirnov z test , error analeses, normality plot (frequency with regression standardized residual),normal p-p plot or we use 70% of the data for previous analeses steps

Homework Answers

Answer #1

To build up the model or doing any statistical analysis we split up the whole data in two parts viz training and validation part. Training part have 70% of the data and validation part have 30% of the data. We use to build model base on training data and validate it that is cheak how much our model fits good based on validation data. We do not use the whole sample for the statistical analysis or the measures mentioned in question, it is done in training part of the data and how good the model is or how good it predicts is done by validation data. But if the sample size is less than 500 we do not split the data, we then take the whole data for analysis steps.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Sixth-Grade Reading Test Scores    79         74         78         70         80&nb
Sixth-Grade Reading Test Scores    79         74         78         70         80         72         75         80         83         80        74         73         70         74         81         75         81         82         76         71         76         74         81         74         74         77         76         74         80         76        Refer to the data in Exercise 1 above. Perform a Kolmogorov-Smirnov one-sample test to examine the normality of this sample of scores. Your response should be organized according to the 8 steps in hypothesis testing. Report your findings. (10 points) Hypothesis...
For the following experiment/question, pick the most appropriate statistical test. You have the following statistical tests...
For the following experiment/question, pick the most appropriate statistical test. You have the following statistical tests as choices: some may be used more than once, others not at all.  Assume homogeneity of variance (where applicable) and the validity of parametric tests (where applicable), unless something is directly stated (e.g., “the data are not at all normal”) or otherwise indicated (viz., by the inspection of the data) which would indicate a strong and obvious violation of an assumption. This means you must...
For the following experiment/question, pick the most appropriate statistical test. You have the following statistical tests...
For the following experiment/question, pick the most appropriate statistical test. You have the following statistical tests as choices: some may be used more than once, others not at all. Assume homogeneity of variance (where applicable) and the validity of parametric tests (where applicable), unless something is directly stated (e.g., “the data are not at all normal”) or otherwise indicated (viz., by the inspection of the data) which would indicate a strong and obvious violation of an assumption. This means you...
Statistical Analysis for Business Applications I Consider the following data representing the total time (in hours)...
Statistical Analysis for Business Applications I Consider the following data representing the total time (in hours) a student spent on reviewing for the Stat final exam and the actual score on the final. The sample of 10 students was taken from a class and the following answers were reported. time score 0 23 4 30 5 32 7 50 8 45 10 55 12 60 15 70 18 80 20 100 Part 1: Use the formulas provided on the 3rd...
For the following experiments/questions, pick the most appropriate statistical test. You have the following statistical tests...
For the following experiments/questions, pick the most appropriate statistical test. You have the following statistical tests as choices: some may be used more than once, others not at all.  Assume homogeneity of variance (where applicable) and the validity of parametric tests (where applicable), unless something is directly stated (e.g., “the data are not at all normal”) or otherwise indicated (viz., by the inspection of the data) which would indicate a strong and obvious violation of an assumption. This means you must...
(1) A Chi-squared test is typically used to test for any of the following except which...
(1) A Chi-squared test is typically used to test for any of the following except which of the following? (A) If a mathematical model accurately predicts our observed frequencies of data values. (B) If a mathematical model accurately predicts the total number of observed data values. (C) If a mathematical model accurately predicts the pattern of our observed data values. (D) Whether two factors present in a population are independent of one another. (E) Whether a series of populations experience...
The manager of an amusement park would like to be able to predict daily attendance to...
The manager of an amusement park would like to be able to predict daily attendance to develop more accurate plans about how much food to order and how many ride operators to hire. After some consideration, he decided that the following three factors are critical: Yesterday’s attendance Weekday or weekend (1 if weekend, 0 if a weekday) Predicted weather Rain forecast ( 1 if the forecast for rain, 0 if not) Sun   ( 1 if mostly sunny, 0 if not)...
I. Solve the following problem: For the following data: 1, 1, 2, 2, 3, 3, 3,...
I. Solve the following problem: For the following data: 1, 1, 2, 2, 3, 3, 3, 3, 4, 4, 5, 6 n = 12 b) Calculate 1) the average or average 2) quartile-1 3) quartile-2 or medium 4) quartile-3 5) Draw box diagram (Box & Wisker) II. PROBABILITY 1. Answer the questions using the following contingency table, which collects the results of a study to 400 customers of a store where you want to analyze the payment method. _______B__________BC_____ A...
1. A city official claims that the proportion of all commuters who are in favor of...
1. A city official claims that the proportion of all commuters who are in favor of an expanded public transportation system is 50%. A newspaper conducts a survey to determine whether this proportion is different from 50%. Out of 225 randomly chosen commuters, the survey finds that 90 of them reply yes when asked if they support an expanded public transportation system. Test the official’s claim at α = 0.05. 2. A survey of 225 randomly chosen commuters are asked...
1.The sample mean is an unbiased estimator for the population mean. This means: The sample mean...
1.The sample mean is an unbiased estimator for the population mean. This means: The sample mean always equals the population mean. The average sample mean, over all possible samples, equals the population mean. The sample mean will only vary a little from the population mean. The sample mean has a normal distribution. 2.Which of the following statements is CORRECTabout the sampling distribution of the sample mean: The standard error of the sample mean will decrease as the sample size increases....
ADVERTISEMENT
Need Online Homework Help?

Get Answers For Free
Most questions answered within 1 hours.

Ask a Question
ADVERTISEMENT