Collocation Page Help
This page allows you to make a list of words statistically associated with the word/s (node) in your query.
The Attribute menu uses word as a default, but you can also find the:
- tag: the part of speech tag
- lempos: the lemma conjoined by a hyphen with a shortened form for the part of speech e.g. n for noun
- lemma: the stemmed form of the word
- lc: word in lowercase
- lemma_lc: the stemmed form of the word in lower case
You can specify the range (span of text) around your node word when considering candidates. The defaults are -5 (5 tokens before the node) and 5 (5 tokens after the node).
You can specify thresholds on
- the frequency of the candidate in the corpus
- the frequency of the candidate within the range
You can specify which statistics are displayed, and the statistic to sort by. For information on the statistics available see ske-stat.pdf
Note that You can save these options before making your list
