<< Click to Display Table of Contents >> Navigation: Reference > Transforms > Ngram |
Counts the number of times each sequence of consecutive words appear in the selected column.
Find ngrams of 2 to 3 word length (bigrams and trigrams) in the 'Keywords' column.
One.
•Select the Column you wish to analyze for ngrams.
•Set Minimum N to the minimum number of words in an ngram.
•Set Maximum N to the maximum number of words in an ngram.
•Uncheck case sensitive to convert everything to lower case before counting ngrams.
•Words are made up of letters, digits and apostrophes ('). All other characters are treated as word separators.
•All letters are converted to lower case.
•The output sorted by number of words in the ngram, then the count and then the ngram. Use a Sort transform to sort it in a different order.
•Use a Replace transform before the Ngram transform to remove/replace any words or letters you don't wish to analyze.