Data wrangling exercises
- datadoubleconfirm

- 52 minutes ago
- 1 min read
The best way to know whether you enjoy working with data — or to learn how to work with data — is to handle many different forms of messy, real-world datasets, i.e. perform the necessary data cleaning and transformations to derive meaningful insights from them.
This post shares two sample Jupyter notebooks that demonstrate various data transformation steps to achieve clean data for two different data formats:
- JSON output obtained from calling the Developer API of the SingStat Table Builder (Singapore Department of Statistics): https://github.com/hxchua/datadoubleconfirm/blob/master/notebooks/SingStat%20API%202025.ipynb
- Raw CSV file hosted on GitHub: https://github.com/hxchua/datadoubleconfirm/blob/master/notebooks/githubraw.ipynb
The example data relates to Graduates from University First Degree Courses by Type of Course and Sex in Singapore.






Comments