wiki:Corpora/PortugueseCorpus

Portuguese corpus

The CetemPúblico/CetenFolha? Portuguese corpus installed here is a part of the resource that has been collected by the  Linguateca project, comprising the Público newspaper from Portugal and the Folha newspaper from Brazil.

The corpora were processed by Eckhard Bick's Palavras dependency parser and the word sketches were generated from the parser output.

With thanks to the publishers of the newspapers, the Linguateca team and Eckhard Bick.