Quantcast
Channel: Just Another Data Blog
Viewing all articles
Browse latest Browse all 8

A Data Cleaning Example

0
0
For this particular example,

  • the variables of interest are stored as key:value pairs and
  • a single data cell could contain multiple (unknown) number of key:value pairs.
Basically, we want to convert input dataset on LHS to the output dataset on the RHS as illustrated in the graphic below -



The objective is to separate these key-value pairs and store the values in corresponding key columns.

The hadleyverse packages make this task a fairly simple one, especially tidyr, stringr and magrittr.

Viewing all articles
Browse latest Browse all 8

Latest Images

Trending Articles





Latest Images