The Gibbs sampler applied to missing data with categorical, continuous and mixed data types.

Item

Title
The Gibbs sampler applied to missing data with categorical, continuous and mixed data types.
Identifier
AAI9630440
identifier
9630440
Creator
Boslaugh, Sarah E.
Contributor
Adviser: Alan Gross
Date
1996
Language
English
Publisher
City University of New York.
Subject
Education, Educational Psychology | Statistics
Abstract
In the present research we have investigated the problem of estimating multiple correlations for regression models containing a mix of categorical, continuous and interaction terms given a data set containing missing values. We consider the case where the model has a single binary predictor variable, a single continuous predictor variable, and a cross-product term. A Bayesian approach is used to obtain an interval estimate of the multiple correlation of Y on the predictors ({dollar}\rho\sp2{dollar}) using a Gibbs sampling procedure. Using 5,000 samples from the posterior distribution of {dollar}\rho\sp2{dollar}, we empirically construct.90 highest-density regions (HDR's) for {dollar}\rho\sp2{dollar}. To demonstrate the estimation procedure, 32 data samples were used; 18 with data missing completely at random (MCAR) and 18 with data missing at random (MAR). Within each set of 18, three sample sizes (30, 50 or 100), three population values for {dollar}\rho\sp2{dollar} (.10,.25 and.50) and two probabilities of missing data (.271 and.657) were used. In the MCAR case, 17 of the 18 HDR's contained the population {dollar}\rho\sp2{dollar}, while in the MAR cases, 16 of the 18 HDR's contained the population {dollar}\rho\sp2{dollar}. As expected, smaller sample sizes and more missing data produced wider HDR's, and MAR data produced slightly wider HDR's than MCAR data.
Type
dissertation
Source
PQT Legacy CUNY.xlsx
degree
Ph.D.
Item sets
CUNY Legacy ETDs