## In 1978, William J. Wilson (then at the University of Chicago’s sociology department; now at Harvard, where good professors go to die) published his first major book,

### statistics

##### Description

STAT 1021 – Introduction to Statistics

Spring 2020

Assignment #3

In 1978, William J. Wilson (then at the University of Chicago’s sociology department; now at Harvard, where good professors go to die) published his first major book, The Declining Signifi-cance of Race, which posits that class-based factors have begun to supplant factors related to race per se as the primary causes of unequal social outcomes for Black Americans. In short, Wilson argues that Black folks were formerly disadvantaged simply because they were Black; today, by contrast, Black Americans’ unequal access to education, jobs, quality neighborhoods, and other resources are responsible for persistent disparities in social outcomes (rather than race as such).

We will test Wilson’s hypothesis using data from the oldest (1972) and newest (2018) GSS waves. If Wilson is correct, we expect that in the older data, black respondents will show poorer outcomes than white respondents even after controlling for education and other factors; in the newer data, while blacks may still have poorer outcomes in general, these disparities will shrink or even vanish after controlling for education and other factors.

Tables analyzing the data from the 1972 and 2018 GSS data are presented below. Because Wilson was primarily interested in black-white differences, analyses have been restricted to black and white respondents only (other racial groups provided relatively few observations anyway). The SEI variable was not added to the GSS until 1988, so we will use occupational prestige as our primary outcome. (Assume for the purpose of the assignment that the 1970 and 2010 scales for occupational prestige are directly comparable.)

1.   From the 1972 GSS data:

A.    [10 points] Table 1A shows the results of a naïve linear regression model for the effect of being black on occupational prestige. In a short paragraph, define and interpret the main terms, including r2. In doing so, answer the questions: How much difference is there between blacks and whites? How significant is it? How much of the variation in occupational prestige is explained by race alone?

Table 1A: 1972 GSS linear regression model results

B.     [15 points]  Table 1B shows the results of a more complex model, taking into account years of education (recentered on 12) and age (recentered on 18), as well as interaction effects for race and education and for race and age. In a paragraph, define and interpret all the terms in the model, including r2. In doing so, answer the questions: How much difference is there between blacks and whites, either at baseline (main effect) or in response to changes in age or education (interaction effects)? How significant are the differences? How much of the variation in occupational prestige is explained by this model?

Table 1B: 1972 GSS multiple regression model results

C.  [10 points]  Check the adequacy of the model in (B). Assess the assumptions of:

i.          linear relationship between (continuous) predictors and outcome, from figures below