# R Dataset / Package HistData / Yeast

Attachment | Size |
---|---|

dataset-14587.csv | 344 bytes |

Dataset Help |
---|

On this Picostat.com statistics page, you will find information about the Yeast data set which pertains to Student's (1906) Yeast Cell Counts. The Yeast data set is found in the HistData R package. You can load the Yeast data set in R by issuing the following command at the console data("Yeast"). This will load the data into a variable called Yeast. If R says the Yeast data set is not found, you can try installing the package by issuing this command install.packages("HistData") and then attempt to reload the data. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the Yeast R data set. The size of this file is about 344 bytes. |

Documentation |
---|

## Student's (1906) Yeast Cell Counts## DescriptionCounts of the number of yeast cells were made each of 400 regions in a 20 x 20 grid on a microscope slide, comprising a 1 sq. mm. area. This experiment was repeated four times, giving samples A, B, C and D. Student (1906) used these data to investigate the errors in random sampling. He says "there are two sources of error: (a) the drop taken may not be representative of the bulk of the liquid; (b) the distribution of the cells over the area which is examined is never exactly uniform, so that there is an 'error of random sampling.'" The data in the paper are provided in the form of discrete frequency distributions
for the four samples. Each shows the frequency distribution squares containing
a ## Usagedata(Yeast) data(YeastD.mat) ## Format
`sample` Sample identifier, a factor with levels `A` `B` `C` `D` `count` The number of yeast cells counted in a square `freq` The number of squares with the given `count`
## DetailsStudent considers the distribution of a total of ## SourceD. J. Hand, F. Daly, D. Lunn, K. McConway and E. Ostrowski (1994).
## References"Student" (1906) On the error of counting with a haemocytometer. Biometrika, 5, 351-360. http://www.medicine.mcgill.ca/epidemiology/hanley/c626/Student_counting.pdf ## Examplesdata(Yeast)require(lattice) # basic bar charts # TODO: frequencies should start at 0, not 1. barchart(count~freq|sample, data=Yeast, ylab="Number of Cells", xlab="Frequency") barchart(freq~count|sample, data=Yeast, xlab="Number of Cells", ylab="Frequency", horizontal=FALSE, origin=0)# same, using xyplot xyplot(freq~count|sample, data=Yeast, xlab="Number of Cells", ylab="Frequency", horizontal=FALSE, origin=0, type="h", lwd=10) -- Dataset imported from https://www.r-project.org. |

Curated Data | File Size |
---|---|

OpenIntro Statistics Dataset - yrbss_samp | 7.62 KB |

OpenIntro Statistics Dataset - yrbss | 1.03 MB |

OpenIntro Statistics Dataset - yawn | 861 bytes |

OpenIntro Statistics Dataset - xom | 4.95 KB |

OpenIntro Statistics Dataset - winery_cars | 489 bytes |

#### Pagination

All Public Datasets | File Size |
---|---|

PSID | |

wage1 | 328 bytes |

OpenIntro Statistics Dataset - yrbss_samp | 7.62 KB |

OpenIntro Statistics Dataset - yrbss | 1.03 MB |

OpenIntro Statistics Dataset - yawn | 861 bytes |

#### Pagination

Recent Queries For This Dataset |
---|

No queries made on this dataset yet. |

Recent Queries | App/Dataset | By | Date |
---|---|---|---|

Picostat Output - Cumulative Frequency Histogram | R Dataset / Package datasets / warpbreaks | Jaliste | October 31, 2020 - 9:22 PM |

Picostat Output - Boxplot | R Dataset / Package datasets / warpbreaks | Jaliste | October 31, 2020 - 9:21 PM |

Picostat Output - Boxplot | R Dataset / Package datasets / warpbreaks | Jaliste | October 31, 2020 - 9:21 PM |

Picostat Output - Simple Linear Regression | R Dataset / Package wooldridge / crime2 | ikramnajah | October 31, 2020 - 9:49 AM |

Picostat Output - Simple Linear Regression | R Dataset / Package datasets / esoph | hyi2 | October 29, 2020 - 5:44 AM |