Applies a set of find/replace rules to the incoming data
This step works similar to most find/replace functions out there. The base functionality is that it looks for whatever you typed in, and allows you to replace it when found. That can be applied to all columns, or one specifically.
By default, the step ignores casing, but you can enforce it by clicking the icon in the Find field.
You'll notice that you can create as many Find Replace rules as you'd like in the step - each represented by a card.
The rules are applied to the entire dataset/column in top-down order. So the first rule will be applied to every row, and the replacements will be made, and then the next rule will be applied.
This means that if you replace a value with something that will be picked up by a subsequent rule, additional updates will be made to that field.
Regular expressions are extremely powerful when processing data. They allow you to express a pattern that can be matched in the data. They can also be frustrating to work with, and time-consuming to construct.
First, to enable the use of RegEx (in either just the Find field or both the Find and the Replace fields), click the asterisk * icon in the Replace field to turn it blue.
Reference this Regular Expression build tool and cheat sheet for supported RegEx formats and help making an expression that will work.
The Find field is where you will put the pattern to match. If you are using capture groups, then you can use syntax in the replace field to reference those groups. $& will replace with all capture groups, while $1, $2 will replace with capture group 1, capture group 2, and so on.