Phonological constraints and free variation in compounding: A corpus study of English and Estonian noun compounds.

Item

Title
Phonological constraints and free variation in compounding: A corpus study of English and Estonian noun compounds.
Identifier
AAI3213181
identifier
3213181
Creator
Sepp, Mary.
Contributor
Adviser: Martin Chodorow
Date
2006
Language
English
Publisher
City University of New York.
Subject
Language, Linguistics | Language, Modern
Abstract
This research was designed to examine the patterns of variation in the phonological and/or orthographic form of Estonian and English noun compounds. Estonian noun compounds generally occur in one of two forms: N1(nominative) + N2, as in kool + meister ("schoolmaster"), or N1(genitive) + N2, as in kooli + opetaja ("schoolteacher"). Some Estonian compounds vary freely in form---e.g., veebsepp/veebisepp ("webmaster"). English noun compounds exhibit orthographic variation, as they may be written in three ways: closed ("bookstore"), hyphenated ("dot-com"), or open ("space station"). Many English compounds also vary freely---e.g., cellphone/cell-phone/cell phone. The principal goal of this study was to use statistical data derived from corpora to determine which variables best account for the choice of variant compound forms.;The 1,094 Estonian compounds used in this research came from a one million word corpus of Estonian literary and news texts. Data on variation of form were obtained from Google searches of the World Wide Web. Results showed a strong preference for genitive forms, and it was posited that this preference is due to general principles of ease of pronunciation and ease of perception.;Phonology is also a factor in the distribution of English compounds. A number of phonological variables were examined in the current study: number of syllables, presence of compound stress, vowel sequences across internal lexical boundaries, and double consonants across internal lexical boundaries. Frequency data for these variables were extracted from a fourteen million word English corpus. Results of multiple regression analyses showed that the number of syllables in the compound is a stronger predictor of orthographic form than the other phonological features that were tested. Phonology was not assumed to be the only influence, however; lexical features were also examined. Results indicated a substantial contribution of the second constituent in predicting whether the compound would be open or closed, and a lesser, though important, contribution of the first constituent. A regression analysis combining phonological and lexical variables accounted for about 68% of the variance in the orthography of 707 high frequency English noun compounds.
Type
dissertation
Source
PQT Legacy CUNY.xlsx
degree
Ph.D.
Item sets
CUNY Legacy ETDs