DATA DIVE DAYS

Process and product of various data science tasks— from data collection, data preparation, data visualization, to basic statistical analysis and modelling. Datasets for practice available.

Selected as Top 100 Data Science Resources for 2018

on MastersInDataScience.com

November 2, 2019

In this webscraping attempt, I want to get data on countries, sites and categories of sites in one table. One challenge I faced is to get the data for the sites to correspond/ match with the countries that are tied to them. The sites can be extracted through parsing th...

June 23, 2019

I got to chance upon the emoji python package that allows printing emojis in Python and decided to collect some data relating to emojis listed on the emoji cheat sheet. There were associated terms/ descriptions (i.e. alternative names) for each emoji and they were scra...

April 24, 2019

There are various data items, such as channel name, title of video, and number of views, likes, dislikes, and comments, that can be retrieved from using YouTube Data API v3. This is a free service however there are limitations on the number of requests we can make. Sho...

October 9, 2018

In addition to BeautifulSoup, selenium is a very useful package for webscraping when it involves repeated user interaction with the website (eg. to click to select options from certain dropdown list and submit) to generate a desired output/ result of interest. Selenium...

May 11, 2018

This is Part I of a two-part post. Part I talks about scraping data from SGDI while Part II outlines the process of presenting the data using Tableau.  

The code builds on the one covered in a previous post on how to use Beautifulsoup in Py...

January 30, 2018

This is Part II of a four-part post. Part I talks about scraping data from a website (bookdepository.com, in this case) while Part II discusses data cleaning/ preparation. Part III outlines the process of presenting the data using Tableau and Part IV delves into insigh...

January 17, 2018

This is Part I of a four-part post. Part I talks about scraping data from a website (bookdepository.com, in this case) while Part II discusses data cleaning/ preparation. Part III outlines the process of presenting the data using Tableau and Part IV delves into insight...

December 17, 2017

This is Part I of a four-part post. Part I talks about collecting text data from Twitter while Part II discusses analysis on text data i.e. text mining. Part III outlines the process of presenting the data using Tableau and Part IV delves into insights from the analysi...

Please reload

Recent Posts

December 3, 2019

Please reload

Archive

Please reload

Tags