Cleanse
The Cleanse
action can help eliminate some of the impurities within the dataset to allow better post-processing. For instance, due to human error, there are leading spaces in some cells within a column, "Female" vs. " Female", causing the data to arbitrarily inflate in unique categories. Or in another example, some values are "nulls" within a numeric field, forcing the field to become a string field and lose all the numeric features.
Acho currently supports cleaning action at the column level, and the available cleaning actions are:
cut string
from left
from right
remove a specific character
padding with a rule
from left
from right
uppercase/lowercase standardization
capitalize all characters
capitalize the first character
decapitalize all character
trim leading zeros
trim spaces
from both left and right
from left
from right
replacing string nulls with blanks
replacing numeric nulls with zero
Last updated