censusn                package:dprep                R Documentation

_T_h_e _c_e_n_s_u_s _d_a_t_a_s_e_t

_D_e_s_c_r_i_p_t_i_o_n:

     This is the census dataset from the UCI where the values of the
     nominal attributes  are numerically codified. This dataset
     contains plenty of missing values.

_U_s_a_g_e:

     data(censusn)

_F_o_r_m_a_t:

     A data frame with 32561 observations on the following 14
     variables.

     _V_1 age:continuous

     _V_2 workclass:

     _V_3 fnlwgt:continuous

     _V_4 education

     _V_5 marital-status:

     _V_6 occupation:

     _V_7 relationship:

     _V_8 race

     _V_9 sex

     _V_1_0 capital-gain: continuous.

     _V_1_1 capital-loss: continuous.

     _V_1_2 hours-per-week: continuous.

     _V_1_3 native-country:

     _V_1_4 class: >50K, <=50K 

_D_e_t_a_i_l_s:

     The fifth and fourth features of the orginal dataset were the
     same, since the  fifth contained the numerical codifications of
     the fourth. In  censusn only one of these feature is considered.
     The values of the nominal attributes are as follows:  workclass:
     Private, Self-emp-not-inc, Self-emp-inc, Federal-gov, Local-gov,
     State-gov, Without-pay, Never-worked. education: Bachelors,
     Some-college, 11th, HS-grad, Prof-school, Assoc-acdm, Assoc-voc,
     9th, 7th-8th, 12th, Masters,  1st-4th, 10th, Doctorate, 5th-6th,
     Preschool. marital-status: Married-civ-spouse, Divorced,
     Never-married, Separated, Widowed, Married-spouse-absent,
     Married-AF-spouse. occupation: Tech-support, Craft-repair,
     Other-service, Sales, Exec-managerial, Prof-specialty,
     Handlers-cleaners, Machine-op-inspct,  Adm-clerical,
     Farming-fishing, Transport-moving, Priv-house-serv,
     Protective-serv, Armed-Forces. relationship: Wife, Own-child,
     Husband, Not-in-family, Other-relative, Unmarried. race: White,
     Asian-Pac-Islander, Amer-Indian-Eskimo, Other, Black. sex: Female,
     Male. native-country: United-States, Cambodia, England,
     Puerto-Rico, Canada, Germany, Outlying-US(Guam-USVI-etc), India,
     Japan,  Greece, South, China, Cuba, Iran, Honduras, Philippines,
     Italy, Poland, Jamaica, Vietnam, Mexico, Portugal, Ireland,
     France, Dominican-Republic, Laos, Ecuador, Taiwan, Haiti,
     Columbia, Hungary, Guatemala, Nicaragua, Scotland, Thailand,
     Yugoslavia,  El-Salvador, Trinadad&Tobago, Peru, Hong,
     Holand-Netherlands.

_S_o_u_r_c_e:

     The UCI Machine Learning Database Repository at:

        *  <URL: ftp://ftp.ics.uci.edu/pub/machine-learning-databases>

        *  <URL: http://www.ics.uci.edu/~mlearn/MLRepository.html>

_E_x_a_m_p_l_e_s:

     data(censusn)
     #----knn imputation------
     data(censusn)
     imagmiss(censusn, "censusn")

