Easy Data Transform v1.26.0
For Mac and Windows

This release adds improvements and bug fixes to v1.25.0. The download links are at the bottom of the page.

We recommend that you back-up important transform files before installing any new release. Please let us know of any problems or issues you encounter with the release.

You can uninstall old releases after you have successfully installed this version.

Windows and Mac

1. There is now a Characters tab in the Right pane. You can use this to see a profile of which types of characters occur in which columns.

Data profile

It checks for:

Note that some of these categories are non-exclusive. For example: a space might be counted as a space, whitespace and a leading space, and a symbol might also be a non-ASCII character.

Hover over a cell for more details.

Data profile details

You can turn the colored bars off.

Profile data graphics off

And you can restrict it to sampling only a subset of rows for improved speed in large datasets.

Data profiling sample

The Characters tab can be used in conjunction with the Replace and Whitespace transforms to cleanse a dirty dataset.

2. The Column Values window has been improved. You can display this window by double clicking on a column header or clicking the button.

Column values profile

You can now:

3. The Row Values window has been improved. You can display this window by double clicking on a row header or clicking the button.

Row values

You can now change the row without closing the window.

4. The Dedupe transform now supports ‘fuzzy’ matching. This allows you to remove duplicates that are similar, but not identical.

fuzzy dedupe

For example, doing a fuzzy match to this value:

100 avenue street, townsville, ohio

Gives:

Value Fuzzy match
100 avenue street, townsville, ohio 100%
100 avnue street, townsville, ohio 98%
100 avenue street townsville ohio 95%
100 avenue st., townsville, ohio 89%
100 avenue st, townsville 72%
100 av. st., citysville, texas 52%
townsville, ohio 46%
742 evergreen terrace, springfield, oregon 36%


Note that fuzzy matching is significantly slower than exact matching.

5. The Dedupe transform now has options to ignore whitespace and ignore punctuation. Thanks to Geoff H. for the suggestion.

fuzzy dedupe ignore space

6. The Dedupe transform now has a Mode option.

fuzzy dedupe mode

The options are:

7. You can now click the Explore Duplicates… button in the Right pane of the Dedupe transform to see duplicates found with various options. Press the Run button once you have set the options.

Explore duplicates

In the Data tab, duplicate rows are colored pink and arranged under the rows they are duplicates of. The following 5 additional columns are added at the end:

A summary of the results is shown in the Summary tab.

Dedupe summary

Click OK to keep any change you have made in the settings.

Thanks to Geoff H. for suggestions related to improving de-duplication.

8. In the Stack transform you can now check add separator rows to add separator rows between each input. The With value value is added in each column of the separator rows. Thanks to Chris K. for the suggestion.

Stack separator

9. In the New Rows transform you can now set As to User defined row(s) to add user defined rows. For example to add separator rows.

10. The Replace transform now had additional match options for Currency and Non-ASCII.

Replace match options

11. You can now toggle a column’s checked/unchecked state using the space bar.

Press space to toggle

12. The Batch Process window user interface has now been improved so that it can better handle larger numbers of input and output files. Thanks to Marcus A. for reporting this.

13. Memory usage has been improved in batch processing.

14. The default Replace transform is now faster.

15. The Dedupe transform now defaults to having no columns selected.

16. The Mode numeric value output in Summary is now limited to 5 values. Previously the number was unlimited.

17. There was a bug in v1.25.0 where a column set in the Sort transform to sort Ascending could be reset to sort Descending. This is now fixed. Thanks to Frank S. for reporting this.

Downloads

Windows Logo Windows Download

v1.26.0 for Windows 11/10/8/7 (41 MB)

Apple Logo Mac Download

v1.26.0 for Mac 12.x to 10.13 (42 MB)

Questions or problems?

Email support@easydatatransform.com