Introduction
Through this blog I intend to work on some data analysis projects, publish the results here and get feedback from other data experts. I have exposure to R, Python, MATLAB and would be using suitable...
View ArticleKnow India through Visualisations - 1
I'm going to produce just a couple of charts, a teaser of sorts in this post. In the forthcoming posts I'll dig deeper.I was amazed with the existing list of R packages to work with spatial data,...
View ArticleAccessing Open Data Portal (India) using APIs
EDIT: I've wrapped up this code into an R package. You can find more info about it on this blog post and here on GitHub.As I mentioned in my previous blog post, Government of India have started an Open...
View Articledplyr Use Cases: Non-Interactive Mode
The current release of dplyr (v 0.4.1) offers lot more flexibility regarding usage of important verbs in non-interactive mode. In this post, I'm exploring different possible use-cases.group_by_,...
View Articleogdindiar: R package to easily access Open Government Data from India Portal
Following up on my earlier posts on accessing Open Government Data from R, I've wrapped this code into an R package - ogdindiar. It's available on GitHub at https://github.com/steadyfish/ogdindiarIt...
View ArticleSurvival Analysis - 1
I recently was looking for methods to apply to time-to-event data and started exploring Survival Analysis Models. In this post, I'm exploring basic KM estimator. It is a nonparametric estimator of the...
View ArticleSurvival Analysis - 2
In my previous post, I went over basics of survival analysis, that included estimating Kaplan-Meier estimate for a given time-to-event data. In this post, I'm exploring on Cox's proportional hazards...
View ArticleA Data Cleaning Example
For this particular example,the variables of interest are stored as key:value pairs anda single data cell could contain multiple (unknown) number of key:value pairs.Basically, we want to convert input...
View Article