Assignment3.docx

The Best WritersExploratory Data Analysis (EDA) and Linear Regression[Type the document subtitle]Bs3/14/2022Assignment 3Part A:First of all, we need to import the libraries and read the CSV file of COVID Variants and print the first five rows of the dataset which is as follows:Descriptive statistics:Then, we need to analyze the descriptive statistics of data frame. Descriptive statistics give the statistic description of the dataset. We can calculate sum( ) and mean( ) and count( ) etc. of each columns of the dataset. Here, we calculate the sum() of the dataset.Here, we calculate the mean() of the dataset.Here, we find the statistic description of categorical data of the dataset.Here, we find the statistic description of all the attributes of the dataset.Histograms:The histogram of the COVID Variants Dataset contains the plots of numeric data. This plot consists of num_sequences, perc_sequences and num_sequences_total. The histogram of the COVID Variants Dataset is shown below:Bar charts:The bar chart of the COVID Variants Dataset is shown below:Heat maps:The heat map of the COVID Variants Dataset is shown below:Line graphs:The line graph of the COVID Variants Dataset is shown below:Box plots:The box plot of the COVID Variants Dataset is shown below:Frequency tables:Now, this is the frequency table of COVID Variants Dataset. This frequency table consists of the number of variants in the dataset it tells the frequency of each variant.This frequency table consists of the location where variants are found. It tells the frequency of each location.