Our hypothesis:
Similar email addresses indicate chances of customers being the same. This hypothesis observed accuracy. It encouraged us to try another hypothesis. Exact phone numbers indicate the chances of two customers being the same. We felt AI confident venturing into one more hypothesis. We observed a high accuracy of customers being the same when the exact street address and house number matched. The stage was set. Confidence was restored. Data warriors had come back. Common sense had prevailed. We needed to respect the first rule of the game.
Do not take any data with the assumption that it is clean.
We compared records by email address, telephone, street address but not by name. Only thinking outside the box came to our rescue. This project shook the core belief system that one needs to attempt solving a problem in an organized manner. Going disruptive was our only way to find potential matches and restore sanity back to the CRMs.