Oct 3, 2021
Covid-19 projections based on Curve Fitting using R
Based on a simplistic model of curve fitting, we are able to derive a projection that coincides with what was reported, "Singapore could...
Oct 30, 2019
Cross-sharing data on data.world
While I mainly host my datasets on my Github repository, I have also cross-shared some datasets on data.world as the platform is...
Sep 15, 2019
Installing R kernel in Jupyter
With R, and Ananconda installed, we can also use R in Jupyter notebook. So my previous laptop died and now I have to re-install...
May 7, 2019
Sample size and correlation
A typical question faced is how much data is considered enough. The answer is it depends. First and foremost, we need to know what...
Dec 15, 2018
Text frequency analysis - Process - R
This is a tutorial to get the frequency distribution of words used in a chunk of text and is a simpler alternative to a more elaborate...
Jun 16, 2018
Data cleaning - Process - R
Data cleaning is one of the most important tasks in data science but it is unglamorous, underappreciated and under-discussed. These are...
Jun 9, 2018
Creating hypothetical datasets - Process - R
Sometimes you want to get started on analyzing data with the main objective of practising the basics of a certain language. So the focus...
Mar 23, 2018
WiDS 2018 Datathlon - Process - R
It's been a while since the closing of the Datathlon but I hadn't gotten the time to write about it. This is my first attempt at...
Dec 24, 2017
Text mining - Process - R
This is Part II of a four-part post. Part I talks about collecting text data from Twitter while Part II discusses analysis on text data...
Nov 22, 2017
Getting started
Over the years, the number of tools (or software) I have to install/ use increased steadily given the different types of tasks I have to...