wiki:tagsets/cuptreetaggertagset

Penn Treebank Tagset

TreeTagger version as used in Sketch Engine

POS Tag Description Example
CCcoordinating conjunctionand
CDcardinal number1, third
DTdeterminerthe
EXexistential therethere is
FWforeign wordd'hoevre
INpreposition, subordinating conjunctionin, of, like
IN/thatthat as subordinatorthat
JJadjectivegreen
JJRadjective, comparativegreener
JJSadjective, superlativegreenest
LSlist marker1)
MDmodalcould, will
NNnoun, singular or masstable
NNSnoun pluraltables
NPproper noun, singularJohn
NPSproper noun, pluralVikings
PDTpredeterminerboth the boys
POSpossessive endingfriend's
PPpersonal pronounI, he, it
PP$possessive pronounmy, his
RBadverbhowever, usually, naturally, here, good
RBRadverb, comparativebetter
RBSadverb, superlativebest
RPparticlegive up
SENTSentence-break punctuation. ! ?
SYMSymbol/ [ = *
TOinfinitive 'to'to go
UHinterjectionuhhuhhuhh
VBverb, base formtake
VBDverb, past tensetook
VBGverb, gerund/present participletaking
VBNverb, past participletaken
VBPverb, sing. present, non-3dtake
VBZverb, 3rd person sing. presenttakes
WDTwh-determinerwhich
WPwh-pronounwho, what
WP$possessive wh-pronounwhose
WRBwh-adverbwhere, when
###
$$$
"Quotation marks' "
Opening quotation marks' "
(Opening brackets( {
)Closing brackets) }
,Comma,
:Punctuation- ; : -- ...

See also:

  1. Marcus, Beatrice Santorini and M.A. Marcinkiewicz: Building a large annotated corpus of English: The Penn Treebank. In Computational Linguistics, volume 19, number 2, pp313-330.

Main differences to default Penn tagset

In TreeTagger tagset

  • For proper nouns, NNP and NNPS have become NP and NPS
  • SENT for end-of-sentence punctuation (other punctuation tags may also differ)

In TreeTagger+SketchEngine

  • "to" now gets IN when it is a preposition and TO only when it is an infinitive marker

Please also note that the WebBootCat version of the treetagger distinguishes the verb tags for "be" (VB) and "have" (VH) from other (non-modal) verbs (VV)


DM May 2010 for Lexical Computing Ltd