Process and product of various data science tasks— from data collection, data preparation, data visualization, to basic statistical analysis and modelling. Datasets for practice available.

Selected as Top 100 Data Science Resources for 2018


June 26, 2018

This post is a replicate of the previous post on R but using Python this time round. However, note that there is a difference in data randomly generated by R and Python. For most of this exercise, we use prepared hypothetical datasets.

Data cleaning is one of...

June 16, 2018

Data cleaning is one of the most important tasks in data science but it is unglamorous, underappreciated and under-discussed. These are some common tasks involved in data cleaning but not limited to: 

  • Merging/ appending

  • Checking completeness of data​​

  • ...

January 30, 2018

This is Part II of a four-part post. Part I talks about scraping data from a website (, in this case) while Part II discusses data cleaning/ preparation. Part III outlines the process of presenting the data using Tableau and Part IV delves into insigh...

Please reload

Recent Posts

Please reload


Please reload


©2017-2019 by DATA DOUBLE CONFIRM. Proudly created with

This site was designed with the
website builder. Create your website today.
Start Now