Normalize whitespaces
Normalize whitespaces step removes tabs, new line or carriage return characters, and extra spaces from the input text and replaces them with a single space character when in between words, or removes them completely when before the first word or after the last word. This step can help you optimize your data and quickly remove unwanted characters that were entered in error or that are in an unwanted format.
Parameters
-
Input column: required, a string column containing input text.
-
Target column: required, configure the column which will receive the output. Output will always be of string type.
-
Write result to the current column: overwrite the Input column with the result.
-
Create new column with name: create a new column with specified name. Name of the new column can contain spaces or special characters - technical column name will be created automatically. The new column will be placed right after the Input column.
-
Examples
Input value | Output value | Explanation |
---|---|---|
"Text with single spaces." |
"Text with single spaces." |
No change because there are no extra whitespace characters. |
"Text with lots of space characters." |
"Text with lots of space characters." |
All extra spaces are removed. |
"Text with tabs and |
"Text with tabs and a new line." |
All extra spaces, tabs, and new line/carriage return characters are removed. |
" Test word " |
"Test word" |
Extra spaces at the beginning and end are removed. |
No value |
No value |
Empty value produces empty value on output. |
Error |
Error |
Applying the step to an error results in another Error. |