Data Links #21

Brief reminder: the weekly Data Links article is the place where I gather all the links involving data science (and some other unrelated topics) that I found interesting during last week. Unless there is some major disruption, you can expect to find it every Monday (time might vary) here.

more ...

Bond movies through data

The new James Bond movie, Spectre, will be out in a few weeks. I thought it would be a very nice exercise to do a quick check on how did previous movies do in terms of overall quality (as assessed by the critics and / or the public) and box office …

more ...



Data Links #18

  • Here you have a new data analysis competition, this time brought to us by DrivenData: Naive Bees Classifier. From the competition description: Metis wants to know: using images from BeeSpotter can you identify a bee as a honey bee or a bumble bee? These bees have different behaviors and appearances …

more ...


Data Links #16

  • Are you a user of the amazing R caret package? Do you sometimes wonder why, when you use svmRadial or svmLinear (or an SVM with any other kernel), the result changes a bit (or a lot) depending on the logical control variable classProbs? Here is your answer, written by no …

more ...


From wide to long format in one line of R

A Spanish journalist friend told me his following problem: he had a dataset of longitudinal data (say, 10 years, one data point per year) for several countries. While gathering the data, he chose a wide structure: a column for the country name and a column for each year of data …

more ...