This page contains outdated information from 2012-13 and will be updated in October 2013 with very similar material.
This page contains links to various data files you will need to complete the assignments as well as links to online resources which might be useful for the module.
Notes and clarifications
Here are some of the Mathematica notebooks which I used for the lecture demonstrations:
The files are all in ASCII format with columns separated by spaces or tabs. Generally some detail about the data contained in each file is explained in comment lines at the top of the file. You will probably need to delete these comments to read the data into an analysis programme such as MatLab.
Here are some data files for the projects:
- coffee.dat (data for everyone)
- synthetictimeseries.dat (data for project 1)
- words.dat (data for project 3)
- weblinks.dat (data for project 3)
- fatTails.dat (unknown dataset for project 3) fatTails2.dat (larger sample from the same distribution)
- dicerolls.txt (data for project 1)
- Online Time Series Data Library - contains lots of useful examples from a huge variety of applications.
- M. E. J. Newman, "Power laws, Pareto distributions and Zipf's law." Contemporary Physics 46, 323 (2005).
- A. Broder et al., "Graph structure in the Web." Computer Networks 33, 309 (2000).
- A. Clauset, C.R. Shalizi, and M.E.J. Newman, "Power-law distributions in empirical data" SIAM Review 51(4), 661-703 (2009). (arXiv:0706.1062, doi:10.1137/070710111)