Process and product of various data science tasks— from data collection, data preparation, data visualization, to basic statistical analysis and modelling. Datasets for practice available.

Selected as Top 100 Data Science Resources for 2018/2019


February 1, 2020

SingStat (Singapore Department of Statistics) has made quite some data publicly available but not in the most analyst-friendly format. For those who are looking for more Singapore-related data to analyze, we can make use of the API function to call these data tables in...

October 30, 2019

While I mainly host my datasets on my Github repository, I have also cross-shared some datasets on as the platform is integrated with quite a couple of other tools. And also, is more user-friendly for users who might not want to dabble into Github...

April 24, 2019

There are various data items, such as channel name, title of video, and number of views, likes, dislikes, and comments, that can be retrieved from using YouTube Data API v3. This is a free service however there are limitations on the number of requests we can make. Sho...

March 10, 2019

Kaggle API was released one year ago but I have not tried it before till now. Mainly because I have trouble finding time to participate in Kaggle competitions. Now I'm trying to look at this year's WiDS datathon (I participated the previous year and thought it was a gr...

Recent Posts

