In this blog post, I provide a worked example demonstrating how to perform an analysis of blanks on a target dataset. When analyzing data a typical first step is to get an understanding of where there are missing values. Identifying where there are missing values in your data can help you make more informed decisions about your analysis approach.
This blog post demonstrates how to identify and remove duplicate records from a dataset. I provide a worked example that shows how to configure and implement the deduplicate function against some sample customer data. The deduplicate function is a critical action which allows the workflow developer to create rich data validation and transformation rules.