R Dataset / Package HistData / ZeaMays
Attachment  Size 

dataset75564.csv  350 bytes 
Documentation 

On this Picostat.com statistics page, you will find information about the ZeaMays data set which pertains to Darwin's Heights of Cross and Selffertilized Zea May Pairs. The ZeaMays data set is found in the HistData R package. You can load the ZeaMays data set in R by issuing the following command at the console data("ZeaMays"). This will load the data into a variable called ZeaMays. If R says the ZeaMays data set is not found, you can try installing the package by issuing this command install.packages("HistData") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the ZeaMays R data set. The size of this file is about 350 bytes. Darwin's Heights of Cross and Selffertilized Zea May PairsDescriptionDarwin (1876) studied the growth of pairs of zea may (aka corn) seedlings, one produced by crossfertilization and the other produced by selffertilization, but otherwise grown under identical conditions. His goal was to demonstrate the greater vigour of the crossfertilized plants. The data recorded are the final height (inches, to the nearest 1/8th) of the plants in each pair. In the Design of Experiments, Fisher (1935) used these data to illustrate
a paired ttest (well, a onesample test on the mean difference, Usagedata(ZeaMays) FormatA data frame with 15 observations on the following 4 variables.
DetailsIn addition to the standard paired ttest, several types of nonparametric tests can be contemplated: (a) Permutation test, where the values of, say (b) Permutation test based on assigning each (c) Wilcoxon signed rank test: tests the hypothesis that the median signed rank of the SourceDarwin, C. (1876). The Effect of Cross and Selffertilization in the Vegetable Kingdom, 2nd Ed. London: John Murray. Andrews, D. and Herzberg, A. (1985) Data:
a collection of problems from many fields for the student and research worker.
New York: Springer. Data retrieved from: ReferencesFisher, R. A. (1935). The Design of Experiments. London: Oliver & Boyd. See Also
Examplesdata(ZeaMays)################################## ## Some preliminary exploration ## ################################## boxplot(ZeaMays[,c("cross", "self")], ylab="Height (in)", xlab="Fertilization")# examine large individual diff/ces largediff < subset(ZeaMays, abs(diff) > 2*sd(abs(diff))) with(largediff, segments(1, cross, 2, self, col="red"))# plot cross vs. self. NB: unusual trend and some unusual points with(ZeaMays, plot(self, cross, pch=16, cex=1.5)) abline(lm(cross ~ self, data=ZeaMays), col="red", lwd=2)# pot effects ? anova(lm(diff ~ pot, data=ZeaMays))############################## ## Tests of mean difference ## ############################## # Wilcoxon signed rank test # signed ranks: with(ZeaMays, sign(diff) * rank(abs(diff))) wilcox.test(ZeaMays$cross, ZeaMays$self, conf.int=TRUE, exact=FALSE)# ttests with(ZeaMays, t.test(cross, self)) with(ZeaMays, t.test(diff))mean(ZeaMays$diff) # complete permutation distribution of diff, for all 2^15 ways of assigning # one value to cross and the other to self (thx: Bert Gunter) N < nrow(ZeaMays) allmeans < as.matrix(expand.grid(as.data.frame( matrix(rep(c(1,1),N), nr =2)))) %*% abs(ZeaMays$diff) / N# uppertail pvalue sum(allmeans > mean(ZeaMays$diff)) / 2^N # twotailed pvalue sum(abs(allmeans) > mean(ZeaMays$diff)) / 2^Nhist(allmeans, breaks=64, xlab="Mean difference, crossself", main="Histogram of all mean differences") abline(v=c(1, 1)*mean(ZeaMays$diff), col="red", lwd=2, lty=1:2)plot(density(allmeans), xlab="Mean difference, crossself", main="Density plot of all mean differences") abline(v=c(1, 1)*mean(ZeaMays$diff), col="red", lwd=2, lty=1:2)  Dataset imported from https://www.rproject.org. 
Picostat Manual 

How To Register With a Username
How To Register With Google Single Sign On (SSO)
How To Login With a Username and Password
How To Login With Google Single Sign On (SSO)
How To Import a Dataset
How To Perform Statistical Analysis with Picostat

Recent Queries For This Dataset 

No queries made on this dataset yet. 
Title  Authored on  Content type 

R Dataset / Package MASS / muscle  March 9, 2018  1:06 PM  Dataset 
R Dataset / Package MASS / DDT  March 9, 2018  1:06 PM  Dataset 
OpenIntro Statistics Dataset  pm25_2011_durham  August 9, 2020  2:30 PM  Dataset 
R Dataset / Package wooldridge / jtrain2  March 9, 2018  1:06 PM  Dataset 
R Dataset / Package DAAG / nsw74psid3  March 9, 2018  1:06 PM  Dataset 