Urgenthomework logo
UrgentHomeWork
Live chat

Loading..

Math2349: Data Preprocessing- World Assessment Answer

1HABE

R Studio Assignment Help:

Task:

You will use WHO data set for Tasks 1- 5. Read the WHO data using an appropriate function and complete the tasks 1-5.

1-  Tidy Task 1:

Use appropriate “tidyr” functions to reshape the WHO data set into the form given below:

2- Tidy Task 2:

The WHO data set is not in a tidy format yet. The “code” column still contains four different variables’ information (see variable description section for the details). Separate the “code” column and form four new variables using appropriate “tidyr” functions.  The final format of the WHO data set for this task should be in the form given below:

3- Tidy Task 3:

The WHO data set is not in a tidy format yet. The “rel”, “ep”, “sn”, and “sp” keys need to be in their own columns as we will treat each of these as a separate variable. In this step, move the “rel”, “ep”, “sn”, and “sp” keys into their own columns. The final format of the WHO data set for this task should be in the form given below:

4- Tidy Task 4:

There is one more step to tidy the WHO data set. We have two categorical variables “sex” and “age”. Use “mutate()” to factorise sex and age. For “age” variable, you need to create labels and also order the variable. Labels would be: <15, 15-24, 25-34, 35-44, 45-54, 55-64, 65>=. The final tidy version of the WHO data set would look like this:

5- Task 5: Filter & Select

Drop the redundant columns “iso2” and “new”, and filter any three countries from the tidy version of the WHO data set. Name this subset of the data frame as “WHO_subset”.

You will use surveys and species data sets for Tasks 6 – 10. Read the species and surveys data sets using an appropriate function. Name these data frames as “species” and “surveys”, respectively.

6- Task 6: Join 

Combine “surveys” and “species” data frames using the key variable “species_id”. For this task, you need to add the species information (“genus”, “species”, “taxa”) to the “surveys” data. Rename the combined data frame as “surveys_combined”.

7- Task 7: Calculate

Using the “surveys_combined” data frame, calculate the average weight and hindfoot length of one of the species observed in each month (irrespective of the year). Make sure to exclude missing values while calculating the average.

8- Task 8: Missing Values

Select one of the years in the “surveys_combined” dataframe, rename this data set as “surveys_combined_year”. Using “surveys_combined_year” dataframe, find the total missing values in “weight” column grouped by species. Replace the missing values in “weight” column with the mean values of each species. Save this imputed data as “surveys_weight_imputed”.

9- Task 9: Inconsistencies or Special Values

Inspect the “weight” column in “surveys_weight_imputed” dataframe for any further inconsistencies or special values (i.e., NaN, Inf, -Inf). Trace back and explain briefly why you got such a value.

10- Task 10: Outliers

Using the “surveys_combined” data frame, inspect the variable hindfoot length for possible univariate outliers. If you detect any outliers use any of the methods outlined in the Module 6 notes to deal with them. Explain briefly the actions that you take to handle outliers.


Buy Math2349: Data Preprocessing- World Assessment Answer Online


Talk to our expert to get the help with Math2349: Data Preprocessing- World Assessment Answers to complete your assessment on time and boost your grades now

The main aim/motive of the management assignment help services is to get connect with a greater number of students, and effectively help, and support them in getting completing their assignments the students also get find this a wonderful opportunity where they could effectively learn more about their topics, as the experts also have the best team members with them in which all the members effectively support each other to get complete their diploma assignments. They complete the assessments of the students in an appropriate manner and deliver them back to the students before the due date of the assignment so that the students could timely submit this, and can score higher marks. The experts of the assignment help services at urgenthomework.com are so much skilled, capable, talented, and experienced in their field of programming homework help writing assignments, so, for this, they can effectively write the best economics assignment help services.

Get Online Support for Math2349: Data Preprocessing- World Assessment Answer Assignment Help Online

Resources

    • 24 x 7 Availability.
    • Trained and Certified Experts.
    • Deadline Guaranteed.
    • Plagiarism Free.
    • Privacy Guaranteed.
    • Free download.
    • Online help for all project.
    • Homework Help Services
); }
Copyright © 2009-2023 UrgentHomework.com, All right reserved.