wiki:SkE/Help/PageSpecificHelp/ConcordanceQuery

Concordance Query Help

Using Query Type, you can refine the type of query you wish to make in the main panel. The options are:

  • Simple: standard query will include capitalisation and inflected variants (see SimpleQuery for details)
  • Lemma: enter the stemmed form e.g. hat, and inflected forms of the same word will be included e.g. hats. You can specify the PoS (grammatical part of speech i.e. noun, verb etc...)
  • Phrase: uses the exact words entered e.g. hats and scarves
  • word form: you can specify the exact word form, the concordance will include capitalisation variants but you can select match case if required, and you can also specify the part of speech (PoS)
  • Character: you can supply a character string that can be matched e.g. ble will match words containing this character sequence. This might be particularly useful in languages where tokenisation is difficult.
  • CQL: uses complex query language see The Corpus Query Language (CQL); clicking at "Tagset summary" opens a web page listing the tags that are used in the current corpus. For example, the British National Corpus uses CLAWS 5 while the British Academic Spoken Corpus uses CLAWS 7.

Expert Options

If Context is selected in the LHS menu, on the main panel you can specify criteria on the context for your query. You can choose to specify the context in terms of surrounding lemma(s) and/or PoS tag(s):

  • Lemma filter: the stemmed (base/root) form of the word that you want to specify as a constraint on the context
  • PoS filter: the part of speech (grammatical class) of words in the context window

For both of these filters you can specify

  • Window: whether this should be right, left or both of the queried term (node) and the size (in tokens) of the window in either direction
  • you can supply more than one lemma or PoS and then stipulate whether:
    • all apply (all)
    • some of them should apply (any)
    • none should apply (none)

Text Types Here you can select a subcorpus or create a new subcorpus from a subset of the current corpus. You can also stipulate constraints on the text types for documents that will be searched for your query

Querying Parallel Corpora

(new in version 2.54-2.89)

If you open a parallel corpus (such as europarl5_de_en), two more query boxes will appear in the concordance query form: Aligned part and NOT in aligned part. Both fields allow you to filter the results of the query, so as the aligned parts match the specified criteria.

For Example, you can query the corpus for occurrences of word "car" that are aligned with "Auto", but not containing the article "das".

See also more advanced options of querying parallel corpora.