wiki:SkE/Help/PageSpecificHelp/Thesaurus

Thesaurus Entry

In the "Thesaurus Entry Form" you enter

  • a corpus
  • a lemma (base or stem form of a word) and
  • the part of speech of that lemma (e.g. noun)

click Show similar words to see a "distributional thesaurus" which consists of a ranked list of the lemmas most similar to the lemma entered in terms of grammatical and collocational behaviour. Please note that a distributional thesaurus is an automatically produced "thesaurus" which finds words that tend to occur in similar contexts as the target word. It is not a manually constructed thesaurus of synonyms.

Advanced Options

On the main panel you can specify

  • the maximum number of ranked lemmas
  • whether to cluster items (see also Clustering Neighbours documentation)
  • a threshold on the minimum similarity between lemmas in the same cluster

NB that you can save these options