A valuable tool in choice modelling is the Dirichlet-multinomial distribution. It’s a compound of the multinomial and Dirichlet distributions and it works like this: A choice between N options is modelled as a multinomial distribution with parameters θ1, θ2, θ3 … θN, where the thetas also represent the probabilities of each option being chosen. For … Continue reading
This one came about because I was searching for a data set on horror films (don’t ask) and ended up with one describing the links between philosophers. To cut a long story very short I’ve extracted the information in the influenced by section for every philosopher on Wikipedia and used it to construct a network … Continue reading
I’ve been interested in setting up an Amazon Web Services EC2 instance for a while – essentially a remote desktop in the cloud, which can be handy when you want an always-on machine (say, to run scripts at particular times, or to have easy access to a particular machine setup). Over the next few weeks, … Continue reading
Welcome to Drunks and Lampposts, our new statistics and data related blog. We’re both big fans of the world wide web as a fantastic learning and doing resource: it has never been easy to learn the theory behind a new technique, but then also be able to implement that with your own data and answer … Continue reading