Last modified 2 years ago
The corpus is prepared by Steven Bird. it process is described here
All material is taken from here. It was part-of-speech tagged and lemmatised using TreeTagger?, a leading part-of-speech tagger which has been trained for a number of languages.
Grammatical relation definitions as prepared by David Tugwell for other English corpora were used.
Word sketches are of first version
