Open topic with navigation
Data cleansing and standardization improves data quality and facilitates subsequent data processing. The DMExpress Translate and ToUpper functions can be used to standardize data by removing spaces and punctuation and aligning the case of alphabetic characters.
The attached example demonstrates how to improve data quality by removing spaces and punctuation, and aligning the case of alphabetic characters, as shown in the following input and output:
To standardize the values in the State column, periods and spaces are removed using the Translate function, alphabetic characters are aligned to upper case using the ToUpper function, and the final result is stored in a value as follows:
StandardizeState = ToUpper(Translate(RL_CityState.state, '. ', ''))
These standardized values are then used in the Reformat Target Layout and written to the Standardized State column for output.
160_DataStandardizeStates.zip, compatible with DMExpress version 7.4.0 or higher
For additional information on DMExpress functions, see DMExpress functions reference in the DMExpress Help.
Copyright © 2016 Syncsort All rights reserved.