Practice Problem Set 4: Relational Data
Run the following code to load the data. Look through this website to learn more about the data.
Sketch out the relationship between all the tables in the database. For each dataset, what is the primary key? What are secondary keys?
How many flights operated by Jet Blue departed from EWR in January, 2013?
How many flights that departed from JFK in 2013 were manufactured by Boeing? Create a subset of flights data that meet these conditions - departed from JFK in 2013 on planes manufactured by Boeing.
For each of the airports in the
flightsdata, count the number of flights that departed when there was some precipitation.How many of the flights from Q5 experienced some type of delay? What was the average delay?
Count the number of the delayed flights from Q6 by plane manufacturer and the year manufactured.
Are there any flights in the
flightsdata for which we do not have matches in the other datasets? Why might there be no matches?How many flights departing from each of the NYC airports were headed to a different time zone? For each airport, calculate the number from flights departing and landing in each of the time zones. For example, how many departed JFK and landed in America/Chicago time zone, in America/ Los Angeles time zone etc.?
Extract data on flights departing NYC airports for which we have information on the plane manufacturer.