I have a few posts coming up on Apache Mahout so I thought it might be useful to share some notes. I came at it as primarily an R coder with some very rusty Java and C++ somewhere in the back of my head so that will be my point of reference. I’ve also included … Continue reading
This one came about because I was searching for a data set on horror films (don’t ask) and ended up with one describing the links between philosophers. To cut a long story very short I’ve extracted the information in the influenced by section for every philosopher on Wikipedia and used it to construct a network … Continue reading
I’ve been interested in setting up an Amazon Web Services EC2 instance for a while – essentially a remote desktop in the cloud, which can be handy when you want an always-on machine (say, to run scripts at particular times, or to have easy access to a particular machine setup). Over the next few weeks, … Continue reading
Welcome to Drunks and Lampposts, our new statistics and data related blog. We’re both big fans of the world wide web as a fantastic learning and doing resource: it has never been easy to learn the theory behind a new technique, but then also be able to implement that with your own data and answer … Continue reading