The assignment is divided into two parts. You should submit a single Jupyter Notebook and any related scripts or SQL files as a single archive. The notebook should contain a description of your approach as well as any/all processing used to manipulate, cleanse and sanitise the data for purpose. If your dataset exceeds 10MB, then include a working sample of the data that can be used in place of the full dataset. Your project should focus on one of the following themes.