[Get it solved] Analyze the summary of your data. What are the mean and m...

Check Out Our Work & Get Yours Done

Submit Work

Download Sample

Enroll in the complete course for only $250 USD*

Order Now

Submit work Offers

Analyze the summary of your data. What are the mean and median average incomes?

computer science

Description

Read the income dataset, “zipIncomeAssignment.csv”, into R. (You can find the csv file in iLearn under the Content -> Week 2 folder.)
Change the column names of your data frame so that zcta becomes zipCode and meanhouseholdincome becomes income.
Analyze the summary of your data. What are the mean and median average incomes?

4)Plot a scatter plot of the data. Although this graph is not too informative, do you see any outlier values? If so, what are they?

5)In order to omit outliers, create a subset of the data so that:

$7,000 < income < $200,000 (or in R syntax , income > 7000 & income < 200000)

6)What’s your new mean?

7)Create a simple box plot of your data. Be sure to add a title and label the axes.

HINT: Take a look at: https://www.tutorialspoint.com/r/r_boxplots.htm (specifically, Creating the Boxplot.) Instead of “mpg ~ cyl”, you want to use “income ~ zipCode”.

In the box plot you created, notice that all of the income data is pushed towards the bottom of the graph because most average incomes tend to be low. Create a new box plot where the y-axis uses a log scale. Be sure to add a title and label the axes. For the next 2 questions, use the ggplotlibrary in R, which enables you to create graphs with several different types of plots layered over each other.

8)Make a ggplot that consists of just a scatter plot using the function geom_point() with position = “jitter” so that the data points are grouped by zip code. Be sure to use ggplot’s function for taking the log10 of the y-axis data. (Hint: for geom_point, have alpha=0.2).

9)Create a new ggplot by adding a box plot layer to your previous graph. To do this, add theggplot function geom_boxplot(). Also, add color to the scatter plot so that data points between different zip codes are different colors. Be sure to label the axes and add a title to the graph. (Hint: for geom_boxplot, have alpha=0.1 and outlier.size=0).

10) What can you conclude from this data analysis/visualization?

Related Questions in computer science category

Assume You Are Assisting With IR Planning For The Wilmington University Library.

Technique(s) or scheme(s) or method(s) for detecting, preventing or mitigating DoS or Distributed DoS (DDoS) attacks

Discuss protocols, what they are, how they work, their importance to the science of cryptography and the four different types

A C++ program has to be written to create a chat server and add a bot to the system. Extensive analysis and reporting are to be done with the program.

Write a C++ program using pointers that will create dynamically allocated array of monthly sales figures whose size has been input by the user. After prompting the user to input the sales figure, it will find the highest monthly sales amount and the lowes

Create a class that can be used to test data structure - similar to the StudentA.java example found shown below: StudentA. java Note that this class will be used in the projects in the rest of this course to test various data structures and algorithms The

client-based architectures

Security awareness mobile application system

Credit Card Debt The True Cost of Paying Minimum Payment Write a C++ program to output the payment schedule for the amount owed when each month nothing more is charged to the account but only the minimum payment is paid. The output stops when the balance

Systems Analysis and Development Group Project Details

Get Higher Grades Now

Tutors Online

Description

Drop Files Here Or Click to Upload

Get Free Quote!

261 Experts Online

Connect With Us

We Provide Services Across The Globe

Disclaimer: The reference papers or solutions provided by Calltutors.com serve as model papers or solutions for students or professionals and are not to be submitted as it is to any institutions. These documents are intended to be used for research and reference purposes only. University and company's logo's are the property of respected owners. We don't have affiliation with the mentioned universities. By using our services means, you agree to our Honor Code , Privacy Policy , Terms & Conditions , Payment , Refund & Cancellation Policy.

Enroll in the complete course for only $250 USD*

Analyze the summary of your data. What are the mean and median average incomes?

computer science

Description

Get instant assignment help service

Related Questions in computer science category

Policy

Exploring

Other

Connect With Us

We Provide Services Across The Globe