HSB                 package:candisc                 R Documentation

_H_i_g_h _S_c_h_o_o_l _a_n_d _B_e_y_o_n_d _D_a_t_a

_D_e_s_c_r_i_p_t_i_o_n:

     The High School and Beyond Project was a longitudinal study of
     students in the U.S. carried out in 1980 by the National Center
     for Education Statistics.   Data were collected from 58,270 high
     school students (28,240 seniors and 30,030 sophomores) and 1,015
     secondary schools.  The HSB data frame is sample of 600
     observations, of unknown characteristics, originally taken from
     Tatsuoka (1988).

_U_s_a_g_e:

     data(HSB)

_F_o_r_m_a_t:

     A data frame with 600 observations on the following 15 variables. 
     There is no missing data.

     '_i_d' Observation id: a numeric vector

     '_g_e_n_d_e_r' a factor with levels 'male' 'female'

     '_r_a_c_e' Race or ethnicity: a factor with levels 'hispanic' 'asian'
          'african-amer' 'white'

     '_s_e_s' Socioeconomic status: a factor with levels 'low' 'middle'
          'high'

     '_s_c_h' School type: a factor with levels 'public' 'private'

     '_p_r_o_g' High school program: a factor with levels 'general'
          'academic' 'vocation'

     '_l_o_c_u_s' Locus of control: a numeric vector

     '_c_o_n_c_e_p_t' Self-concept: a numeric vector

     '_m_o_t' Motivation: a numeric vector

     '_c_a_r_e_e_r' Career plan: a factor with levels 'clerical' 'craftsman' 
          'farmer' 'homemaker' 'laborer' 'manager' 'military' 
          'operative' 'prof1' 'prof2' 'proprietor' 'protective' 
          'sales' 'school' 'service' 'technical' 'not working'

     '_r_e_a_d' Standardized reading score: a numeric vector

     '_w_r_i_t_e' Standardized writing score: a numeric vector

     '_m_a_t_h' Standardized math score: a numeric vector

     '_s_c_i' Standardized science score: a numeric vector

     '_s_s' Standardized social science (civics) score: a numeric vector

_S_o_u_r_c_e:

     Tatsuoka, M. M. (1988).  Multivariate Analysis: Techniques for
     Educational and Psychological Research (2nd ed.). New York:
     Macmillan, Appendix F, 430-442.

     Retrieved from: <URL:
     http://www.gseis.ucla.edu/courses/data/hbs6.dta>

_R_e_f_e_r_e_n_c_e_s:

     High School and Beyond data files: <URL:
     http://www.sscnet.ucla.edu/issr/da/index/techinfo/I78961.HTM>

_E_x_a_m_p_l_e_s:

     str(HSB)
     # main effects model
     hsb.mod <- lm( cbind(read, write, math, sci, ss) ~
                     gender + race + ses + sch + prog, data=HSB)
     Anova(hsb.mod)

     # Add some interactions
     hsb.mod1 <- update(hsb.mod, . ~ . + gender:race + ses:prog)
     heplot(hsb.mod1, col=palette()[c(2,1,3:6)], variables=c("read","math"))

     hsb.can1 <- candisc(hsb.mod1, term="race")
     heplot(hsb.can1, col=c("red", "black"))

     # show canonical results for all terms
     hsb.can <- candiscList(hsb.mod)
     hsb.can

