SkE/PreparingCorpusOverview

Preparing a Corpus for the Sketch Engine: Overview

To prepare a corpus for the Sketch Engine, we must

(Here we assume a running SketchEngine installation.)

This will give us a corpus which can be queried to give a range of concordances and lists. If, in addition, word sketches are required we must also

  • Prepare a grammatical relations definitions (gramrels) file: see SkE/CorpusQuerying
  • Run the mkws.sh script.

This will also prepare the thesaurus which requires no additional inputs. It takes the word sketch database as input.