Question

From a scatter plot of data r2 = 0.0039 and no correlation is shown between exam...

From a scatter plot of data r2 = 0.0039 and no correlation is shown between exam marks and hours spent studying. The line of best fit of yp= -0.197x + 81 shows a negative slope, showing that the more you study, the lower the exam mark is. What changes to the data set could be made in general to account for this error.

Homework Answers

Answer #1

This could be due to presence of outliers in the data. Correlation coefficients are very sensitive to the presence of outliers in the data and this could be the reason we have got vague results. You could do the following changes in the dataset to account for the errors:

1. Check if there is any incorrectly measured or incorrectly entered data point in the set. This could happen and you would have to remove those points.

2. Remove the outliers and then check the results because presence of an outlier could change the results significantly. Just do remember to always state in a footnote that you have removed an outlier.

3. Another option is to try a different model, like an exponential model or a polynomial model with a higher degree.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
For the following data​ (a) display the data in a scatter​ plot, (b) calculate the correlation...
For the following data​ (a) display the data in a scatter​ plot, (b) calculate the correlation coefficient​ r, and​ (c) make a conclusion about the type of correlation. The ages​ (in years) of six children and the number of words in their vocabulary Choose the correct scatter plot ​(b) The correlation coefficient r is ____ ​(Round to three decimal places as​ needed.) ​(c) Which of the following best describes the type of correlation that exists between age and vocabulary​ size?...
For the following data​ (a) display the data in a scatter​ plot, (b) calculate the correlation...
For the following data​ (a) display the data in a scatter​ plot, (b) calculate the correlation coefficient​ r, and​ (c) make a conclusion about the type of correlation. The ages​ (in years) of six children and the number of words in their vocabulary Age, x Vocabulary size, y 1 300 2 700 3 1000 4 1450 5 2100 6 2400 make a scatter plot based on this ​(b) The correlation coefficient r is ____? ​(Round to three decimal places as​...
For the following data​ (a) display the data in a scatter​ plot, (b) calculate the correlation...
For the following data​ (a) display the data in a scatter​ plot, (b) calculate the correlation coefficient​ r, and​ (c) make a conclusion about the type of correlation. The ages​ (in years) of 6 children and the number of words in their vocabulary. ​(b) The correlation coefficient r is nothing. ​(Round to three decimal places as​ needed.) ​(c) Which of the following best describes the type of correlation that exists between age and vocabulary​ size? A. Weak positive linear correlation...
Find the equation of the regression line for the given data. Then construct a scatter plot...
Find the equation of the regression line for the given data. Then construct a scatter plot of the data and draw the regression line.​ (The pair of variables have a significant​ correlation.) Then use the regression equation to predict the value of y for each of the given​ x-values, if meaningful. The number of hours 6 students spent for a test and their scores on that test are shown below. font size decreased by 1 font size increased by 1...
Find the equation of the regression line for the given data. Then construct a scatter plot...
Find the equation of the regression line for the given data. Then construct a scatter plot of the data and draw the regression line.​ (The pair of variables have a significant​ correlation.) Then use the regression equation to predict the value of y for each of the given​ x-values, if meaningful. The number of hours 6 students spent for a test and their scores on that test are shown below. font size decreased by 1 font size increased by 1...
Find the equation of the regression line for the given data. Then construct a scatter plot...
Find the equation of the regression line for the given data. Then construct a scatter plot of the data and draw the regression line.​ (The pair of variables have a significant​ correlation.) Then use the regression equation to predict the value of y for each of the given​ x-values, if meaningful. The number of hours 6 students spent for a test and their scores on that test are shown below. font size decreased by 1 font size increased by 1...
Find the equation of the regression line for the given data. Then construct a scatter plot...
Find the equation of the regression line for the given data. Then construct a scatter plot of the data and draw the regression line.​ (The pair of variables have a significant​ correlation.) Then use the regression equation to predict the value of y for each of the given​ x-values, if meaningful. The number of hours 6 students spent for a test and their scores on that test are shown below. font size decreased by 1 font size increased by 1...
Find the equation of the regression line for the given data. Then construct a scatter plot...
Find the equation of the regression line for the given data. Then construct a scatter plot of the data and draw the regression line.​ (The pair of variables have a significant​ correlation.) Then use the regression equation to predict the value of y for each of the given​ x-values, if meaningful. The number of hours 6 students spent for a test and their scores on that test are shown below. font size decreased by 1 font size increased by 1...
Find the equation of the regression line for the given data. Then construct a scatter plot...
Find the equation of the regression line for the given data. Then construct a scatter plot of the data and draw the regression line.​ (The pair of variables have a significant​ correlation.) Then use the regression equation to predict the value of y for each of the given​ x-values, if meaningful. The number of hours 6 students spent for a test and their scores on that test are shown below. Hours spent studying, x 1 2 2 4 5 6...
Applications I Consider the following data representing the total time (in hours) a student spent on...
Applications I Consider the following data representing the total time (in hours) a student spent on reviewing for the Stat final exam and the actual score on the final. The sample of 10 students was taken from a class and the following answers were reported. time score 0 23 4 30 5 32 7 50 8 45 10 55 12 60 15 70 18 80 20 100 Part 1: Use the formulas provided on the 3rd formula sheet to compute...