Filtering rows
The Filter rows step allows you to cut down your dataset by keeping or removing rows based on specific conditions.
Building challenge
- Remove any rows from the sales file where units are equal to zero, or the status is equal to Cancelled or Refunded.
Using the Filter Rows Step in Parabola
Along with Edit Columns, the Filter Rows step is one of the most commonly used steps in Parabola. It allows you to specify which rows to keep or remove based on conditions applied to your column values.
How the Filter Rows Step Works
You can filter data based on various conditions, such as:
- Filtering out blank values
- Checking for exact matches (e.g., Status = "Refunded")
- Looking for values that contain a specific keyword
- Applying numerical filters (e.g., greater than, less than, equal to)
Example: Filtering Order Data
Let’s say we have order data and want to remove:
- Orders where Status = Refunded or Cancelled
- Orders where Units Purchased = 0
To set this up in Filter Rows:
- Remove rows where Status = Refunded or Cancelled
- Add another rule to remove rows where Units Purchased = 0
After applying the filter, we can check how many rows were removed:
- Initial dataset: 64 rows
- After filtering: 57 rows
- Total rows removed: 7
Pro Tip: Checking Your Work
- Click on Input 1 to view the original dataset.
- Compare it with the updated results to confirm your filters worked as expected.
Next Step: Formatting Dates
Now that we’ve cleaned up our columns and rows, we can shift focus to date formatting using the Format Date step. Let’s dive in! 🚀