The government provides a data set of MOT outcomes since 2005 at:
https://data.gov.uk/dataset/anonymised_mot_test

The dataset includes many fields including:
Test date, test outcome (inc. failure reason), postcode area, car make, car model, car colour etc.

The aim of this project would be to perform data analysis on this dataset.

Possible directions this project could take based on interest:
- Applying simple statistical models/machine learning techniques to gain some
insight into factors contributing to failure reasons
- How to best visualise large datasets
- How to use a SQL database to improve your data-analysis pipe-line

If there was interest in some of the group continuing to work on this after
the retreat then geostatistical modelling could be an extension of the work
done at the retreat.

 

Suitable for: anyone with an interest in data science