We get invited to solve some very complex data problems and what we found over the years is, when you take into consideration the exact nature of the data and context it is used in, then you can generate much better results. Generic matching engines are good for 80% of duplicates or matches you’re seeking, the remaining 20% need a lot more effort. (I’m using the convention that duplicates refer to records matched with a single data set and matches refer to records matched across multiple data sets.) For many people finding 80% is good enough and they can live... Read more →