Change point detection

Rachel Oughton

A Time series consists of variables measured sequentially at a set of time points. Often these are modelled as being generated by an ongoing process, such as a physical phenomenon, the economy or a person's health or behaviour. If there is a sudden change to this underlying process, the nature of the time series will change. In Change point detection, we try to indentify if/when, and how, the behaviour of the time series changes. This might be detecting the change from 'awake' to 'asleep' from a smart watch data, determining the start of an earthquake from seismic data, looking at the effect of global events such as Covid-19 or a financial crash on the stock market or detecting changes in a patient's vital statistics.

Example

There are a number of important considerations to be made when detecting changes in behaviour:

In this project, you will study a variety of change point methods and apply them to real and synthetic data. There are many time series data sets available, for example at the Office for National Statistics or yahoo! finance, some of which will work well with change point analysis. There is also a collection of real world datasets chosen specifically as part of a study on change point analysis by the Alan Turing Institute.

There are a number of directions you could take with your project, such as

In this project there will be a lot of data analysis and statistical computation. It is therefore essential to be familiar with the statistical package R, as well as general statistical and data analysis concepts.

Resources

If you'd like to read more about change point detection, here are some good starting points:

The R statistical package can be downloaded from CRAN or RStudio

.

Prerequisites/Corequisites

Statistical Concepts II, Statistical Methods III