Today we will discuss the process flow involved in cleansing an address encountered for the purpose of use by the Master Data Management.
Informatica AddressDoctor is designed to deal with improperly truncated data, incomplete data, missing address elements, ambiguous names, and many other challenges while minimizing incorrect modifications to address elements. It also uses fully Unicode-enabled character data processing to handle the complexities of international data.
Steps to cleanse Address:
The AddressDoctor begins the process of cleansing the address by first checking the address entered. If it is in non-Latin characters then it gets translated.
Next step includes the use of different logics and algorithms of AddressDoctor to classify and understand data elements. It also involves storing the different address elements in suitable fields so that they can be further used.
After this each element of the address is compared alongside a reference data set. It is validated positively if all the elements of the address are accurate. Whereas, if the address as a whole fails to make a sense but each individual element is in sync with the reference data then the AddressDoctor attempts to correct the errors.
Informatica AddressDoctor uses algorithms involving fuzzy matching and heuristics to analyze the data elements. It helps to foretell the best possible correction also keeping in mind that no false positives should be created in the process. The clean address thus generated is formatted according to the international postal standards by Informatica AddressDoctor.
Users can also create custom address formats by altering standard address formats. This can be achieved by making modifications in the Informatica AddressDoctor configuration file and setting values of particular fields in it.