About CV & Publications
Research Interests:
- The lexicon; its structure; lexical resources, including published dictionaries; how lexicographers write dictionaries, how they determine what meanings a word has, and how this relates to theoretical discussions of ambiguity and computational work on word sense disambiguation. Evaluating word sense disambiguation programs.
- Language corpora; word frequency distributions; how these vary across language varieties, and how they relate to syntactic and lexical hypotheses; corpus interfaces; automatic and semi-automatic lexical acquisition from corpora; using the web as a corpus.
- Lexical semantics; formalisms for lexical representation.
Current and recent work includes:
- using the web as a source of linguistic data
- the structure and potential of word meanings
- co-ordinating SENSEVAL, a Word Sense Disambiguation evaluation exercise Home page here
- measuring similarity between language corpora
- assessments of role of WSD technology for Language Engineering

