On this Picostat.com statistics page, you will find information about the cf data set which pertains to Cystic fibrosis data. The cf data set is found in the gap R package. You can load the cf data set in R by issuing the following command at the console data("cf"). This will load the data into a variable called cf. If R says the cf data set is not found, you can try installing the package by issuing this command install.packages("gap") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the cf R data set. The size of this file is about 9,107 bytes.
Cystic fibrosis data
This data set contains a case-control indicator and 23 SNPs.
The inter-marker distances (Morgan) are as follows
0.000090, 0.000158, 0.005000, 0.000100, 0.000200, 0.000150, 0.000250,
0.000200, 0.000050, 0.000350, 0.000300, 0.000250, 0.000350, 0.000350, 0.000800,
0.000100, 0.000200, 0.000150, 0.000550, 0.006000, 0.000700, 0.001000
A data frame containing 186 rows and 24 columns
This can be used as an example of converting PL-EM to matrix format,
cfdata <- vector("numeric")
cfname <- vector("character")
for (i in 2:dim(cf))
tmp <- plem2m(cf[,i])
a1 <- tmp[]
a2 <- tmp[]
cfdata <- cbind(cfdata,a1,a2)
a1name <- paste("loc",i-1,".a1",sep="")
a2name <- paste("loc",i-1,".a2",sep="")
cfname <- cbind(cfname,a1name,a2name)
cfdata <- as.data.frame(cfdata)
names(cfdata) <- cfname
Liu JS, Sabatti C, Teng J, Keats BJB, Risch N (2001). Bayesian Analysis of Haplotypes for Linkage
Disequilibrium Mapping. Genome Research 11:1716-1724
Dataset imported from https://www.r-project.org.