From wide to long format in one line of R

Date Tags R / data

A Spanish journalist friend told me his following problem: he had a dataset of longitudinal data (say, 10 years, one data point per year) for several countries. While gathering the data, he chose a wide structure: a column for the country name and a column for each year of data ...

more ...


Data Links #13

  • A new Kaggle competition: Truly native?. Dato is sponsoring this competition with the noble goal of making native advertising live up to its name. With a dataset of over 300,000 raw HTML files containing text, links, and downloadable images, they also want to give Kagglers a challenge that encourages ...

more ...


Data Links #12

What is this? Brief reminder: it is a weekly selection of links related to the field of data analysis and some other stuff that may be completely unrelated but interesting in any case.

more ...

Self-hosted git repositories, a quick and dirty guide

Like most other people, I maintain a certain number of projects on GitHub, typically the ones for which I wish to publish the code. For some other developments for which I need to collaborate with other people but I don't want to publish the code (for instance, Kaggle competitions, I use private repositories in BitBucket). While all these services provide very convenient functionalities, may be you want to have total control over your git repositories for truly private stuff. If that is the case, keep reading.

more ...


Data Links #10

  • The long-denied Spanish housing bubble burst leaving rotten concrete carcasses left and right. These guys are trying to catalog each modern ruin in the country.
  • If you want to really learn how neural networks work, the best thing is to implement them yourself. The second best think is to look ...
more ...