TOWARDS COMPUTATIONAL DISCRIMINATION OF ENGLISH WORD SENSES (STATISTICAL, STOCHASTIC, DISCOURSE).
Item
-
Title
-
TOWARDS COMPUTATIONAL DISCRIMINATION OF ENGLISH WORD SENSES (STATISTICAL, STOCHASTIC, DISCOURSE).
-
Identifier
-
AAI8708276
-
identifier
-
8708276
-
Creator
-
BLACK, EZRA WILLIAM.
-
Contributor
-
John A. Moyne
-
Date
-
1987
-
Language
-
English
-
Publisher
-
City University of New York.
-
Subject
-
Language, Linguistics
-
Abstract
-
An experiment is conducted which compares three different methods of deciding which of three or four senses characterizes each occurrence of a word for which a Key Word In Context concordance has been constructed. The three methods consist of a dictionary-based approach (DG) where categories intended to classify the words and expressions occurring in each concordance line are simply the subject codes of a major dictionary; an approach (DS1) in which categories are obtained via a frequency analysis of words occurring in the immediate neighborhood of the "node word"--the word in focus--of the concordance, and of "content" words occurring anywhere in a given line; and an approach (DS2) chiefly based on the content-analytic categories obtained by closely reading the concordances of a 100-type sampling of words occurring in the 20-25-million-token English text source, consisting of the official proceedings of the Canadian House of Commons. Results are that DG performs extremely poorly--in fact, near-randomly; DS1 and DS2 yield better and substantially similar performances. The conclusion is that domain-general, syntax-based approaches to automatic word sense discrimination and domain-specific, content-analytic approaches need and complement each other.
-
Type
-
dissertation
-
Source
-
PQT Legacy CUNY.xlsx
-
degree
-
Ph.D.
-
Program
-
Linguistics