Regression Methods for Categorical Dependent Variables: Effects on a Model of Student College Choice

dc.contributor.advisorPlucker, Jonathan A.
dc.contributor.advisorDelandshere, Ginette
dc.contributor.authorRapp, Kelly E.
dc.date.accessioned2013-05-15T23:54:07Z
dc.date.available2013-05-15T23:54:07Z
dc.date.issued2013-05-15
dc.date.submitted2012
dc.descriptionThesis (Ph.D.) - Indiana University, School of Education, 2012
dc.description.abstractThe use of categorical dependent variables with the classical linear regression model (CLRM) violates many of the model's assumptions and may result in biased estimates (Long, 1997; O'Connell, Goldstein, Rogers, & Peng, 2008). Many dependent variables of interest to educational researchers (e.g., professorial rank, educational attainment) are categorical in nature but are analyzed using the CLRM (Harwell & Gatti, 2001) even though alternate regression techniques for categorical dependent variables are recommended (Agresti, 1996; Long, 1997). Data obtained from ACT®, Inc., on 5,200 high school seniors in Illinois and Colorado were used to analyze effects of regression method on a model of ascriptive and academic influences on selectivity of postsecondary institution attended. The dependent variable was measured in rank-ordered categories based on self-reported institutional admissions policies and analyzed with classical linear, multinomial logistic, and ordered logistic regressions. Choice of regression method did not affect overall model performance as evidenced by significant F and Likelihood Ratio χ2 tests. The full CLRM was fit moderately-well to the data (R2 = .391), surpassing some previous findings (Hearn, 1988, 1991; Davies & Guppy, 1997). McFadden's R2L measure of strength of association was larger in the multinomial regression than in the ordered regression (R2L = .191 vs. R2L = .158). The multinomial logistic method also correctly predicted dependent variable category with the greatest accuracy (46.3% correct), but Somers' Dyx measure of association was smallest for the multinomial model. Direction and significance of relationship between predictors and the dependent variable was substantively consistent across the CLRM and logistic methods. In all regressions, ACT® score had the most impact on selectivity of institution attended. Threshold values were significant, supporting the assumption of an ordered dependent variable. Due to the CLRM's theoretical and predictive shortcomings and the multinomial model's complexity in interpretation, ordered logistic regression was determined to be the most appropriate for explaining influences on selectivity of postsecondary institution attended.
dc.identifier.urihttps://hdl.handle.net/2022/15879
dc.language.isoen
dc.publisher[Bloomington, Ind.] : Indiana University
dc.rightsThis work may be protected by copyright unless otherwise stated.
dc.subjectcollege selectivity
dc.subjectcomparative study
dc.subjectlogistic regression
dc.subjectordinal data
dc.subjectstudent college choice
dc.subject.classificationEducational psychology
dc.subject.classificationStatistics
dc.subject.classificationHigher education
dc.titleRegression Methods for Categorical Dependent Variables: Effects on a Model of Student College Choice
dc.typeDoctoral Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Rapp_indiana_0093A_11545.pdf
Size:
2.35 MB
Format:
Adobe Portable Document Format
Can’t use the file because of accessibility barriers? Contact us