Model visualisation

This page lists my published software for model visualisation. This work forms the basis for the third chapter of my thesis.

classifly: Explore classification boundaries in high dimensions.

Given p-dimensional training data containing d groups (the design space), a classification algorithm (classifier) predicts which group new data belongs to. Generally the input to these algorithms is high dimensional, and the boundaries between groups will be high dimensional and perhaps curvilinear or multi-facted. This R package provides methods for visualising the division of space between the groups.

clusterfly: Explore clustering results in high dimensions.

Typically, there is somewhat of a divide between statistics and visualisation software. Statistics software, particularly R, provides implementation of cutting edge research methods, but limited graphics. Visualisation software will provide sophisticated visual interfaces, but few statistical algorithms. The clusterfly package presents some early experimentation aimed at overcoming this deficiency by linking R and GGobi. Cluster analysis was chosen as it is an exploratory method that needs sophisticated visualisation and statistical algorithms.

There are also some custom methods for certain types of clustering, mostly inspired by the work of Dr Dianne Cook:

meifly: Models explored interactively.

Meifly is tool that uses R and GGobi to explore ensembles of linear models, where we look at all possible main effects models for a given dataset (or a large subset of these models). This gives greater insight than looking at any small set of best models alone: an ensemble of many models can tell us more about the underlying data than any individual model alone.


Please make sure you have a current version of R and rggobi installed, then use the following R code: