Glossary
Conversion:
In order for the work to be feasible and reliable the data must be entered into separate fields and if it is not, the conversion process performs the separation.
For example, the conversion process separates the street name from the civic number and the locality name.
Using a similar algorithm, it also separates the last name from the first name using a matrix of proper names and special words to separate the data by comparing certain key words. The gender code is also assigned during this phase to personalize the contact in accordance with the gender assigned (F=Female, M=Male, A=Society, 0=No gender).
Separating the last name from the first name also prepares the groundwork for the deduplication phase.
During the conversion of data from separable fields into separated fields, an additional descriptive "care of" field is extrapolated if recognized. This is of prime importance if it exists for the message to be delivered.
Standardization:
The standardization process validates the method of writing the address in question, using an engine to compare the method of writing the address during input with the methods existing in the Abbreviation/Zip Code/Locality/Street/Civic Number matrices.
If an address is not anomalous, we can say for certainty that if it has been recognized by an automatic tool it will certainly be understood by the individual making the delivery.
The correct interpretation for classifying anomalies is that these are addresses for which there exists some doubt that they are deliverable or, at a minimum, they may not meet the official delivery conditions required by the postal service. This is not to say that they are not deliverable, but they should be looked at critically.
Deduplication:
There are at least two benefits to be gained from cleansing a file of duplicate information.
The first is the savings in costs, while the second is improving your image by not sending identical messages to the same person.
Duplicate data aggregation involves running an algorithm to group two records with the same first and last name and the same deliverable address together with the same code (for example, Personal code or Family code).
Deduplication involves making the best interpretation of the various methods of writing in view of the different mistakes that can be made when writing addresses.
Since the classification process in the previous phase has already recognized the locality and/or street in the address, the deduplication process works using codes instead of descriptions.
Interpretation of the writing methods was already performed when this data was checked for deliverability.
Postal automation:
Postal automation involves preparing a data file so that messages can be sent through the mail system. The process verifies consistency between the zip codes and province abbreviations, and sorts the generic zip codes in cities with multiple zip codes. The process also subdivides and sorts the file in accordance with the Printed Materials Guide of the Italian Postal system, and defines the binding for mailing wrappers and the file layout for printing the wrappers for each mailing. The postal automation process also includes preparing the file layout to print the sheets pallets per mail line and providing statistical summaries.