I keep seeing a common phrase that 80% of the work of a Data Scientist is data cleansing. I’m not sure whether this number is a true reflection or not, for me it reinforces the need for good quality data if you want good quality results. If you’re not familiar with the term Data Science, it’s a set of techniques used to extract meaningful information from data sets using methods from Mathematics and Computer Science, particularly, statistics, machine learning, classification and data analytics. You may say that this has been around for over 20 years, and it has, but with... Read more →