Jul 6, 2024
Extracting data from tables in PDF [Updated] - Python
The previous post on Extracting data from tables in PDF was many years old and requires an update as it didn't cover a Python package...
May 24, 2024
OneMap API
If you're looking for the latitude and longitude of places in Singapore, you can make use of OneMap API. It is free for public use. Below...
Oct 26, 2020
Handling None and NaN in Pandas - Python
The other day as I was reading in a data from BigQuery into pandas dataframe, I realised the data type for column containing all nulls...
Jun 13, 2020
EuroPython 2020 Presentation - Announcement
For the latest call for proposals to include other timezones from Asia Pacific and Americas, I decided to submit a talk and am glad that...
Jun 26, 2018
Data cleaning - Process - Python
This post is a replicate of the previous post on R but using Python this time round. However, note that there is a difference in data...
Jun 16, 2018
Data cleaning - Process - R
Data cleaning is one of the most important tasks in data science but it is unglamorous, underappreciated and under-discussed. These are...
Jan 30, 2018
Data preparation for scraped data - Process - Excel
This is Part II of a four-part post. Part I talks about scraping data from a website (bookdepository.com, in this case) while Part II...