Practice Problem Set 2: Data Visualization
Run the following code to load the data. Glimpse the data and learn about the variables:
For the following questions, please create a graph using
ggplotto answer:Create a plot examine the distribution of
re78variable which has information on income of the participants in year 1978. (Note: you have to highlight code that is in different lines and then run inwebrinterface).Now check the difference between the distribution of that variable between the treatment and control groups.
Think about one more different way to visualize the difference in the distribution of
ret78in treatment versus control. Which way do you prefer - one from Q3 or Q4 - and why?Create graphs showing distributions of the other variables in the data for treatment versus control groups.
Examine the relationship between
ageandeduc.Does the relationship between age and education differ between the treatment and control groups?
Examine the relationship between race and income in 1978.
Does the relationship differ by treatment level?
Challenge Problem: Examine the trend in income across the three times given (1974, 1975, 1978). Draw conclusions about the trends.