wiki:Website/ResearchProjects

DANTE

This project, funded largely by the Irish Government, has produced a very-high-quality lexical database for English: see  http://webdante.com LCL people: Adam Kilgarriff, Diana McCarthy, Siva Reddy.

PRESEMT

 PRESEMT is an EU, FP7 project in the area of 'hybrid' Machine Translation, combining statistical and rule-based methods. It runs from 2010-2012. LCL people: Adam Kilgarriff, Jan Pomikalek, Avinesh PVS.

KELLY

 KELLY is an EU Lifelong Learning Programme project, preparing wordlists for learners and wordcards for nine languages. It runs from 2009 to 2011. LCL people: Adam Kilgarriff, Ravi Kiran.

HOO (Helping Our Own)

HOO is an evaluation campaign for English grammar checking - in the specialist domain of computational linguistics papers.

CLAEVIPS

A Corpus Linguistic Analysis of Ecosystems Vocabulary in the Public Sphere. Commissioned by the  UK National Ecosystem Assessment and to be presented at Corpus Linguistics 2011, Birmingham. LCL people: Diana McCarthy, Kate Wild.

PICAE

The Pearson International Corpus of Academic English (PICAE) has been jointly developed by Pearson Language Tests and LCL over the period 2008-2009. . It was first presented at IATEFL, Cardiff, 2009. LCL people: Adam Kilgarriff, David Tugwell.

Oxford Children's Corpus

The OCC (Oxford Children's Corpus) is a corpus of writing for children. It has been developed over the period 2006-2011 with the Educational Division of Oxford University Press and will be presented at Corpus Linguistics 2011, Birmingham. LCL people: Adam Kilgarriff, David Tugwell, Kate Wild.

The Corpus Factory

The Corpus Factory is the company's programme for developing corpora of around 100 million words for the world's 100 largest languages. See  here LCL people: Siva Reddy, Girish Duvuru, Jan Pomikalek.

TenTen Corpora

'TenTen' corpora (of order of magnitude 10 to power ten) is the company's programme for developing very large corpora for the world's largest languages (also as part of the PRESEMT project, above). LCL people: Jan Pomikalek.