prostate {spls} | R Documentation |
This is the Prostate Tumor Gene Expression dataset used in Chung and Keles (2009).
data(prostate)
A list with two components:
The prostate dataset consists of 52 prostate tumor and 50 normal samples.
Normal and tumor classes are coded in 0 and 1, respectively, in y
vector.
Matrix x
is gene expression data and
arrays were normalized, log transformed, and standardized
to zero mean and unit variance across genes as described
in Dettling (2004) and Dettling and Beuhlmann (2002).
See Chung and Keles (2009) for more details.
Singh, D., Febbo, P., Ross, K., Jackson, D., Manola, J., Ladd, C., Tamayo, P., Renshaw, A., DAmico, A., Richie, J., Lander, E., Loda, M., Kantoff, P., Golub, T., and Sellers, W. (2002). "Gene expression correlates of clinical prostate cancer behavior", Cancer Cell, 1, pp. 203–209.
Chung, D. and Keles, S. (2009). "Sparse partial least squares classification for high dimensional data" (http://www.stat.wisc.edu/~keles/Papers/C_SPLS.pdf).
Dettling, M. (2004). "BagBoosting for tumor classification with gene expression data", Bioinformatics, 20, pp. 3583–3593.
Dettling, M. and Beuhlmann, P. (2002). "Supervised clustering of genes", Genome Biology, 3, pp. research0069.1–0069.15.
data(prostate) prostate$x[1:5,1:5] prostate$y