This assessment uses data from the United States Social Security Administration (SSA) which contains names of babies born between 1990 and 2010.

computer science

Description

This assessment uses data from the United States Social Security Administration (SSA) which contains names of babies born between 1990 and 2010. The objectives of this assessment are as follows: (1) visualize the total male and female babies over time, (2) tabulate the most and least popular baby names, and (3) analyze trends in names. You are provided with a file “DATA(1990-2010).csv’ which contains information such as name, sex, total number and the year of births. Using this file, your tasks are as follows: Task 1: Create a pivot table of total births by sex and year and then plot them (Figure 1). Task 2: Calculate the total births over the sample period by grouping the data by name and sex. Subset the group into male and female. Using these subsets, select the top and bottom 3 male and female names. In total, you should have 12 names in total. Report them in a single table (Table 1). Task 3: Using the top male and female names (two names in total), check their trends over time, i.e. plot the total births with these name from 1990 to 2010 (Figure 2). In order to do this, you would first need to create a pivot table. After completing the above tasks, you should have two figures and one table in total. Make a written report of these findings. Your Python script is to be attached in the Appendix of the report.


Related Questions in computer science category