A subset of the original dataset (which is unique to you) has been sent by e-mail and you will need to use this version for the Assignment. In your analytical report you are required to answer the following research questions: Answer all of the questions (Q1, 2, 3 & 4):
Q1: Is there an association between deaths and gender? (Note: you will need to transform the current status of the Covid-19 patient to be in one of two groups (alive or deceased)).
Q2: Is there a difference in age between males and females for those patients deceased? At the end of your summary, critically reflect on how these results provide important information to answer Question 1. (Note: you will need to create a new variable for age which is the difference between the year 2020 and the patients birth year. As the sample size for this analysis is small you can use a threshold of 20% to assess normality. You can also use a significance level of 0.10 and 90% confidence level for the statistical results).
Q3: Are there differences in the number of days from confirmed test to release for the provinces identified in your data?
Q4: Are age and global number (cumulative number of case) significant predictors of the number of days from confirmed test to release? Firstly describe the relationships between each of the independent and dependent variables, and then identify which of the variables explain the largest amount of variation in number of days from confirmed test to release. If researchers are mostly interested in the association between global number and number of days from confirmed test to release, why is the effect of age being examined (provide details relevant to your data)? (Note: use the new variable age you created for Question 2)
