Dedupe

<< Click to Display Table of Contents >>

Navigation:  Reference > Transforms >

Dedupe

Description

Remove duplicate rows.

 

Example

If you are cleaning up a mailing list, you might want to dedupe on the 'email' column, after converting all the emails to lower case.

 

Inputs

One.

 

Options

Check the column(s) you wish to look for duplicate values in.

 

Notes

Rows are considered duplicates if they have exactly the same value in all the checked columns.

Comparisons are case and whitespace sensitive. You can use Case to change the case and Whitespace to remove whitespace before deduping.

When several rows are duplicates, only the top one is retained.

The Unique transform is a more powerful (but more complex and slower) alternative to Dedupe.

 

See also

Dedupe a dataset