Last modified 2 years ago
Penn Treebank Tagset
TreeTagger version as used in Sketch Engine
| POS Tag | Description | Example |
| CC | coordinating conjunction | and |
| CD | cardinal number | 1, third |
| DT | determiner | the |
| EX | existential there | there is |
| FW | foreign word | d'hoevre |
| IN | preposition, subordinating conjunction | in, of, like |
| IN/that | that as subordinator | that |
| JJ | adjective | green |
| JJR | adjective, comparative | greener |
| JJS | adjective, superlative | greenest |
| LS | list marker | 1) |
| MD | modal | could, will |
| NN | noun, singular or mass | table |
| NNS | noun plural | tables |
| NP | proper noun, singular | John |
| NPS | proper noun, plural | Vikings |
| PDT | predeterminer | both the boys |
| POS | possessive ending | friend's |
| PP | personal pronoun | I, he, it |
| PP$ | possessive pronoun | my, his |
| RB | adverb | however, usually, naturally, here, good |
| RBR | adverb, comparative | better |
| RBS | adverb, superlative | best |
| RP | particle | give up |
| SENT | Sentence-break punctuation | . ! ? |
| SYM | Symbol | / [ = * |
| TO | infinitive 'to' | to go |
| UH | interjection | uhhuhhuhh |
| VB | verb, base form | take |
| VBD | verb, past tense | took |
| VBG | verb, gerund/present participle | taking |
| VBN | verb, past participle | taken |
| VBP | verb, sing. present, non-3d | take |
| VBZ | verb, 3rd person sing. present | takes |
| WDT | wh-determiner | which |
| WP | wh-pronoun | who, what |
| WP$ | possessive wh-pronoun | whose |
| WRB | wh-adverb | where, when |
| # | # | # |
| $ | $ | $ |
| " | Quotation marks | ' " |
| Opening quotation marks | ' " | |
| ( | Opening brackets | ( { |
| ) | Closing brackets | ) } |
| , | Comma | , |
| : | Punctuation | - ; : -- ... |
See also:
- Marcus, Beatrice Santorini and M.A. Marcinkiewicz: Building a large annotated corpus of English: The Penn Treebank. In Computational Linguistics, volume 19, number 2, pp313-330.
Main differences to default Penn tagset
In TreeTagger tagset
- For proper nouns, NNP and NNPS have become NP and NPS
- SENT for end-of-sentence punctuation (other punctuation tags may also differ)
In TreeTagger+SketchEngine
- "to" now gets IN when it is a preposition and TO only when it is an infinitive marker
Please also note that the WebBootCat version of the treetagger distinguishes the verb tags for "be" (VB) and "have" (VH) from other (non-modal) verbs (VV)
DM May 2010 for Lexical Computing Ltd
