wiki:SkE/Help/PageSpecificHelp/collocconc

Collocation Page Help

This page allows you to make a list of words statistically associated with the word/s (node) in your query.

The Attribute menu uses word as a default, but you can also find the:

  • tag: the part of speech tag
  • lempos: the lemma conjoined by a hyphen with a shortened form for the part of speech e.g. n for noun
  • lemma: the stemmed form of the word
  • lc: word in lowercase
  • lemma_lc: the stemmed form of the word in lower case

You can specify the range (span of text) around your node word when considering candidates. The defaults are -5 (5 tokens before the node) and 5 (5 tokens after the node).

You can specify thresholds on

  • the frequency of the candidate in the corpus
  • the frequency of the candidate within the range

You can specify which statistics are displayed, and the statistic to sort by. For information on the statistics available see  ske-stat.pdf

Note that You can save these options before making your list