Question

If you are modeling count data, explain why it is not sufficient to analyze ordinary raw...

If you are modeling count data, explain why it is not sufficient to analyze ordinary raw residuals, (yi − μˆi), as you would for ordinary linear models.

Homework Answers

Answer #1

The distribution of counts is discrete, not continuous, and is limited to non-negative values. There are two problems with applying an ordinary linear regression model to these data or consequently it is not sufficient to analyse ordinary raw residuals. The two problems are:

First, many distributions of count data are positively skewed with many observations in the data set having a value of 0. The high number of 0’s in the data set prevents the transformation of a skewed distribution into a normal one.

Second, it is quite likely that the regression model will produce negative predicted values, which are theoretically impossible.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
In our population models, we are modeling discrete populations (you can’t have 3.45 people) with continuous...
In our population models, we are modeling discrete populations (you can’t have 3.45 people) with continuous functions. Explain why this becomes less significant if we are modeling large populations as opposed to small populations.
In relation to data modeling and ERD design, explain comprehensively what is meant by the term...
In relation to data modeling and ERD design, explain comprehensively what is meant by the term cardinality (1 mark). In relation to this same issue, how do we represent in our ERD the minimum cardinality? (1 mark)   How do we represent the maximum cardinality? (1 mark). Consider the two relationships: “a student can enrol in either 1, 2, 3, or 4 courses each semester AND a course can have student enrolments from zero to 500”. How would we represent the...
Analyzing Advertising Expenditure As a data scientist of a company, you want to analyze the following...
Analyzing Advertising Expenditure As a data scientist of a company, you want to analyze the following data collected by your company which relates the advertising expenditure A in thousands of dollars to total sales S in thousands of dollars. The following table shows this relationship: Advertising Expenditure (A) Total Sales (S) 19 325 21 329 21.5 328 23 333 23 331 26 340 27.3 341 Determine if the above relation (A, S) defines a function or not. Using Excel, draw...
Explain the following models and why they are used for data analysis - regression model -...
Explain the following models and why they are used for data analysis - regression model - classification model - decision tree model
Explain why it is important to pick an alpha level (significance level) before you gather and...
Explain why it is important to pick an alpha level (significance level) before you gather and analyze your data and compute your p-value.  What is the difference between data mining and hypothesis testing?
Why would you want to work with z scores rather than raw scores? What is the...
Why would you want to work with z scores rather than raw scores? What is the primary purpose of standard scores?
Why does the management of any companies analyze financial statements? Explain by using the different tools...
Why does the management of any companies analyze financial statements? Explain by using the different tools in analyzing financial statement with proper numerical example. Thank you in advance for not copying other's answers. Please don't forget the proper numerical example.
Choose any ethical issue in healthcare that interests you in medical ethics. • Explain why you...
Choose any ethical issue in healthcare that interests you in medical ethics. • Explain why you selected this topic and why it merits in-depth analysis. • Analyze the implications of the issue for both society and the individual.
Suppose that you are told by your boss to analyze the effectiveness of a tutoring center...
Suppose that you are told by your boss to analyze the effectiveness of a tutoring center on raising students’ grades in Calculus. The tutoring center runs reviews for each Calculus exam throughout the semester. For all students, you have access to SAT and ACT scores, whether or not they had high school Calculus, exams scores for each college Calculus exam, and the college Calculus course grade. Assume that each student either attended no sessions or attended all of them. You...
Do a regression analysis using R software OR Excel. You may use legitimate sources of data...
Do a regression analysis using R software OR Excel. You may use legitimate sources of data such as Yahoo finance, nasdaq.com, bloomberg.com, US Dept of Labor etc. The data you analyze should be business related or economics related. Attempt to explain why you received the results of your analysis. What do you believe caused these results? Is there anything you found in the news or online which would explain these results?
ADVERTISEMENT
Need Online Homework Help?

Get Answers For Free
Most questions answered within 1 hours.

Ask a Question
ADVERTISEMENT