Sometimes you need to cluster or classify text. For example, you might have thousands of items in an e-commerce catalogue and want to cluster them in certain categories. Or you might have thousands of search terms in a log file and want to group them into common themes for search engine optimization or a pay-per-click campaign. You can easily and quickly do this using Easy Data Transform.
Continue this process iteratively, looking at the results and adding more terms, until you are happy.
Alternatively set Clustering to Automatic and let Easy Data Transform try to find the cluster terms.
Change the Closeness, Max clusters and Max time to fine tune the results.
If you set Clustering to Guided you can provide Terms to give the automatic clustering a head start.
Once you are happy with the results you can output them to a file. With the Cluster item selected click on the To File button in the Left pane (you may need to scroll down to see it). Choose the output file location and format. An output item will be added and the new file will be created. No need to ‘run’ anything.
If you need to clean your data before clustering see how to clean data.
For more information on Cluster options see the Cluster documentation.
As well as clustering/classifying data, Easy Data Transform also allows you to combine 65 transforms in many other ways to create complex data transformations step-by-step for numerical, text and date data.
Easy Data Transform can process millions of rows, and input and output in multiple formats. If you need to normalize data in lots of files, you can do it in a single operation using the batch processing feature.
v1.45.0 for Windows 11 / 10 / 8 / 7 (47 MB)
v1.45.0 for Mac 14.x to 10.13 (79 MB)
Questions or problems?