TOWARDS COMPUTATIONAL DISCRIMINATION OF ENGLISH WORD SENSES (STATISTICAL, STOCHASTIC, DISCOURSE).

Item

Title
TOWARDS COMPUTATIONAL DISCRIMINATION OF ENGLISH WORD SENSES (STATISTICAL, STOCHASTIC, DISCOURSE).
Identifier
AAI8708276
identifier
8708276
Creator
BLACK, EZRA WILLIAM.
Contributor
John A. Moyne
Date
1987
Language
English
Publisher
City University of New York.
Subject
Language, Linguistics
Abstract
An experiment is conducted which compares three different methods of deciding which of three or four senses characterizes each occurrence of a word for which a Key Word In Context concordance has been constructed. The three methods consist of a dictionary-based approach (DG) where categories intended to classify the words and expressions occurring in each concordance line are simply the subject codes of a major dictionary; an approach (DS1) in which categories are obtained via a frequency analysis of words occurring in the immediate neighborhood of the "node word"--the word in focus--of the concordance, and of "content" words occurring anywhere in a given line; and an approach (DS2) chiefly based on the content-analytic categories obtained by closely reading the concordances of a 100-type sampling of words occurring in the 20-25-million-token English text source, consisting of the official proceedings of the Canadian House of Commons. Results are that DG performs extremely poorly--in fact, near-randomly; DS1 and DS2 yield better and substantially similar performances. The conclusion is that domain-general, syntax-based approaches to automatic word sense discrimination and domain-specific, content-analytic approaches need and complement each other.
Type
dissertation
Source
PQT Legacy CUNY.xlsx
degree
Ph.D.
Program
Linguistics
Item sets
CUNY Legacy ETDs