TOWARDS COMPUTATIONAL DISCRIMINATION OF ENGLISH WORD SENSES (STATISTICAL, STOCHASTIC, DISCOURSE).

Item

Title: TOWARDS COMPUTATIONAL DISCRIMINATION OF ENGLISH WORD SENSES (STATISTICAL, STOCHASTIC, DISCOURSE).
Identifier: AAI8708276
identifier: 8708276
Creator: BLACK, EZRA WILLIAM.
Contributor: John A. Moyne
Date: 1987
Language: English
Publisher: City University of New York.
Subject: Language, Linguistics
Abstract: An experiment is conducted which compares three different methods of deciding which of three or four senses characterizes each occurrence of a word for which a Key Word In Context concordance has been constructed. The three methods consist of a dictionary-based approach (DG) where categories intended to classify the words and expressions occurring in each concordance line are simply the subject codes of a major dictionary; an approach (DS1) in which categories are obtained via a frequency analysis of words occurring in the immediate neighborhood of the "node word"--the word in focus--of the concordance, and of "content" words occurring anywhere in a given line; and an approach (DS2) chiefly based on the content-analytic categories obtained by closely reading the concordances of a 100-type sampling of words occurring in the 20-25-million-token English text source, consisting of the official proceedings of the Canadian House of Commons. Results are that DG performs extremely poorly--in fact, near-randomly; DS1 and DS2 yield better and substantially similar performances. The conclusion is that domain-general, syntax-based approaches to automatic word sense discrimination and domain-specific, content-analytic approaches need and complement each other.
Type: dissertation
Source: PQT Legacy CUNY.xlsx
degree: Ph.D.
Program: Linguistics

Item sets: CUNY Legacy ETDs

Media: TOWARDS COMPUTATIONAL DISCRIMINATION OF ENGLISH WORD SENSES (STATISTICAL, STOCHASTIC, DISCOURSE).