Colon              package:plsgenomics              R Documentation

_G_e_n_e _e_x_p_r_e_s_s_i_o_n _d_a_t_a _f_r_o_m _A_l_o_n _e_t _a_l. (_1_9_9_9)

_D_e_s_c_r_i_p_t_i_o_n:

     Gene expression data (2000 genes for 62 samples) from the
     microarray experiments of Colon tissue samples of Alon et al.
     (1999).

_U_s_a_g_e:

     data(Colon)

_D_e_t_a_i_l_s:

     This data set contains 62 samples  with 2000 genes: 40 tumor
     tissues, coded 2 and 22 normal tissues, coded 1.

_V_a_l_u_e:

     A list with the following elements: 

       X: a (62 x 2000) matrix giving the expression levels of 2000 
          genes for the 62 Colon tissue samples. Each row corresponds
          to a patient, each column to a gene.

       Y: a numeric vector of length 62 giving the type of tissue
          sample (tumor or normal).

gene.names: a vector containing the names of the 2000 genes for the
          gene expression matrix 'X'.

_S_o_u_r_c_e:

     The data are described in Alon et al. (1999) and can be freely
     downloaded from  <URL:
     http://microarray.princeton.edu/oncology/affydata/index.html>.

_R_e_f_e_r_e_n_c_e_s:

     Alon, U. and Barkai, N. and Notterman, D.A. and Gish, K. and
     Ybarra, S. and Mack, D. and Levine, A.J. (1999). Broad patterns of
     gene expression revealed by clustering analysis of tumor and
     normal colon tissues probed by oligonucleotide arrays, Proc. Natl.
     Acad. Sci. USA,*96*(12), 6745-6750.

_E_x_a_m_p_l_e_s:

     # load plsgenomics library
     library(plsgenomics)

     # load data set
     data(Colon)

     # how many samples and how many genes ?
     dim(Colon$X)

     # how many samples of class 1 and 2 respectively ?
     sum(Colon$Y==1)
     sum(Colon$Y==2)

