Transforming data

Filtering rows

The Filter rows step allows you to cut down your dataset by keeping or removing rows based on specific conditions. 

Building challenge

  • Remove any rows from the sales file where units are equal to zero, or the status is equal to Cancelled or Refunded.

Check your work

Image 1 Image 2

Next lesson

Using the Filter Rows Step in Parabola

Along with Edit Columns, the Filter Rows step is one of the most commonly used steps in Parabola. It allows you to specify which rows to keep or remove based on conditions applied to your column values.

How the Filter Rows Step Works

You can filter data based on various conditions, such as:

  • Filtering out blank values
  • Checking for exact matches (e.g., Status = "Refunded")
  • Looking for values that contain a specific keyword
  • Applying numerical filters (e.g., greater than, less than, equal to)

Example: Filtering Order Data

Let’s say we have order data and want to remove:

  1. Orders where Status = Refunded or Cancelled
  2. Orders where Units Purchased = 0

To set this up in Filter Rows:

  • Remove rows where Status = Refunded or Cancelled
  • Add another rule to remove rows where Units Purchased = 0

After applying the filter, we can check how many rows were removed:

  • Initial dataset: 64 rows
  • After filtering: 57 rows
  • Total rows removed: 7

Pro Tip: Checking Your Work

  • Click on Input 1 to view the original dataset.
  • Compare it with the updated results to confirm your filters worked as expected.

Next Step: Formatting Dates

Now that we’ve cleaned up our columns and rows, we can shift focus to date formatting using the Format Date step. Let’s dive in! 🚀