I know this might be a long shot, however for a project i need to find data to do a statistical test on, preferably a one way or two way anova. I cannot for the life of me find any crude data to analyze. pretty much just need help finding some, any help is appreciated!
So to get the crude data to analyze or any machine learning projects I work on, I generally use: "kaggle.com"
This is a data science website that contains data for different real-life problems that can be used to create relevant models. Since CoronaVirus is grown to such an epidemic, why not use the dataset in order to run your analysis on? Here is the dataset snapshot:
country | region | group | infection_reason | infection_order | infected_by | contact_number | confirmed_date | released_date | deceased_date | state |
China | filtered at airport | visit to Wuhan | 1 | 45 | 20-01-2020 | 06-02-2020 | released | |||
Korea | filtered at airport | visit to Wuhan | 1 | 75 | 24-01-2020 | 05-02-2020 | released | |||
Korea | capital area | visit to Wuhan | 1 | 16 | 26-01-2020 | 12-02-2020 | released | |||
Mongolia | capital area | visit to Wuhan | 1 | 95 | 27-01-2020 | 09-02-2020 | released | |||
Korea | capital area | visit to Wuhan | 1 | 31 | 30-01-2020 | isolated | ||||
Korea | capital area | contact with patient | 2 | 3 | 17 | 30-01-2020 | 19-02-2020 | released | ||
Korea | capital area | visit to Wuhan | 1 | 9 | 30-01-2020 | 15-02-2020 | released | |||
Korea | Jeollabuk-do | visit to Wuhan | 1 | 113 | 31-01-2020 | 12-02-2020 | released | |||
Mongolia | capital area | contact with patient | 2 | 5 | 2 | 31-01-2020 | 24-02-2020 | released | ||
Korea | capital area | contact with patient | 3 | 6 | 43 | 31-01-2020 | 19-02-2020 | released | ||
Korea | capital area | contact with patient | 3 | 6 | 0 | 31-01-2020 | 10-02-2020 | released | ||
Mongolia | capital area | contact with patient in Japan | 2 | 422 | 01-02-2020 | 18-02-2020 | released | |||
Korea | filtered at airport | residence in Wuhan | 1 | 0 | 02-02-2020 | 24-02-2020 | released | |||
Mongolia | capital area | contact with patient | 3 | 12 | 3 | 02-02-2020 | 18-02-2020 | released | ||
Korea | capital area | contact with patient | 2 | 4 | 15 | 02-02-2020 | 24-02-2020 | released | ||
Korea | Gwangju | visit to Thailand | 1 | 450 | 04-02-2020 | 19-02-2020 | released |
Test whether the mean contact_number is the same across the different countries - China, Mongolia, and Korea.
Comment in case any help is required.
Let me know in the comments if anything is not clear. I will reply ASAP! Please do upvote if satisfied!
Get Answers For Free
Most questions answered within 1 hours.