valley                package:Biodem                R Documentation

_R_a_w _m_a_r_r_i_a_g_e _d_a_t_a

_D_e_s_c_r_i_p_t_i_o_n:

     A raw marriage data set, in which every row corresponds to a
     different marriage record.

_U_s_a_g_e:

     data(valley)

_F_o_r_m_a_t:

     A 702 rows by 8 columns dataset. The columns of "valley" contain
     the following information:

       PAR:   (sub)population to which the marriage is referred.
       YEAR:  year in which the marriage was performed.
       SURM:  male surname.
       NM:    male birth (sub)population.
       RM:    male residence (sub)population.
       SURF:  female surname.
       NF:    female birth (sub)population.
       RF:    female residence (sub)population.

     For all columns the letter "X" indicates that the mate was born or
     resident outside of the study area.

_D_e_t_a_i_l_s:

     Marriage data, depending from the used sources, may contain more
     or less information than the "valley" example data set. Columns
     order in the dataset is not relevant. Information on
     (sub)population, male and female surnames are needed to perform
     surname-based analyses on marriage data. NB. Information on mates
     birthplace (or equivalent data) can be used to produce a migration
     matrix (see the "raw.mig" dataset). NB2. Given that surnames may
     contain spaces (e.g. "DE IORIO"), the best way to import surname
     data is to save the original data base as a .csv file, and then
     use the read.csv() or read.csv2() functions. Another option is to
     use GNUMERIC, because it has a text export feature that allows to
     put  brakets ("") to the left and right of every cell content, so
     that composite surnames are read as a string. The resulting text
     file is easily imported by read.table().

_S_o_u_r_c_e:

     Paola Gueresi. Subset of a real marriage dataset

_E_x_a_m_p_l_e_s:

     data(valley)

