Covid-19 Data Analysis in R

  • Category: Data Analysis
  • Client: Test purpose
  • Project date: 25-09-2023
  • Project URL: Click here

Portfolio detail


Objectives:

  • Data Import: The primary objective is to import and describe a COVID-19 dataset.
  • Data Cleaning: Clean the dataset by creating a binary "death_dummy" variable to indicate whether a person has died or not.
  • Statistical Analysis: Analyze the age difference between deceased and surviving individuals and Analyze the impact of gender on the death rate.

Libraries Used:

Hmisc: This library is used for various data manipulation and analysis functions.

Summary:

This R program aims to explore and analyze a COVID-19 dataset. It begins by importing the data and conducting data cleaning by creating a new variable to indicate death status. It then explores the age difference between deceased and surviving individuals and assesses whether this difference is statistically significant. Finally, it investigates the impact of gender on the death rate and determines if there is a statistically significant difference between male and female death rates.Overall, the program provides a preliminary analysis of the dataset, focusing on age and gender as potential factors influencing COVID-19 outcomes