Process and product of various data science tasks— from data collection, data preparation, data visualization, to basic statistical analysis and modelling. Datasets for practice available.

Selected as Top 100 Data Science Resources for 2018/2019


July 31, 2020

The following information for over 5000 job opening listed on a government portal for virtual career fairs was scraped: title, company, date opening posted, job level, contract type, location, salary, job description, requirements, closing date for application, and url...

July 8, 2020

With current deep learning algorithms, we can create a new (averaged) face based on photos of multiple faces and this can be done easily. I decided to try this out on the candidates of the various parties running for the General Election in Singapore.   


May 3, 2020

It was a joy to support PyData Salamanca! Many thanks to my friend Víctor Vicente Palacios for hosting. We conducted the live stream yesterday and are thankful to have many tuning in live. Here's the link to the video for all interested who would like to catc...

March 26, 2020

Looking at the number of updates published on the Ministry of Health's website, we might be able to get a sense of the severity of the coronavirus situation in Singapore and also the amount of efforts/ changes in measures introduced by the government across time.

Data i...

November 2, 2019

In this webscraping attempt, I want to get data on countries, sites and categories of sites in one table. One challenge I faced is to get the data for the sites to correspond/ match with the countries that are tied to them. The sites can be extracted through parsing th...

October 9, 2018

In addition to BeautifulSoup, selenium is a very useful package for webscraping when it involves repeated user interaction with the website (eg. to click to select options from certain dropdown list and submit) to generate a desired output/ result of interest. Selenium...

January 17, 2018

This is Part I of a four-part post. Part I talks about scraping data from a website (, in this case) while Part II discusses data cleaning/ preparation. Part III outlines the process of presenting the data using Tableau and Part IV delves into insight...

Please reload

Recent Posts

Please reload


Please reload