On this Picostat.com statistics page, you will find information about the bfi data set which pertains to 25 Personality items representing 5 factors. The bfi data set is found in the psych R package. Try to load the bfi data set in R by issuing the following command at the console data("bfi"). This may load the data into a variable called bfi. If R says the bfi data set is not found, you can try installing the package by issuing this command install.packages("psych") and then attempt to reload the data with library("psych") followed by data("bfi"). Perhaps strangley, if R gives you no output after entering a command, it means the command succeeded. If it succeeded you can see the data by typing bfi at the command-line which should display the entire dataset.
If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the bfi R data set. The size of this file is about 160,481 bytes.
25 Personality items representing 5 factors
25 personality self report items taken from the International Personality Item Pool (ipip.ori.org) were included as part of the Synthetic Aperture Personality Assessment (SAPA) web based personality assessment project. The data from 2800 subjects are included here as a demonstration set for scale construction, factor analysis, and Item Response Theory analysis. Three additional demographic variables (sex, education, and age) are also included.
A data frame with 2800 observations on the following 28 variables. (The q numbers are the SAPA item numbers).
Am indifferent to the feelings of others. (q_146)
Inquire about others' well-being. (q_1162)
Know how to comfort others. (q_1206)
Love children. (q_1364)
Make people feel at ease. (q_1419)
Am exacting in my work. (q_124)
Continue until everything is perfect. (q_530)
Do things according to a plan. (q_619)
Do things in a half-way manner. (q_626)
Waste my time. (q_1949)
Don't talk a lot. (q_712)
Find it difficult to approach others. (q_901)
Know how to captivate people. (q_1205)
Make friends easily. (q_1410)
Take charge. (q_1768)
Get angry easily. (q_952)
Get irritated easily. (q_974)
Have frequent mood swings. (q_1099
Often feel blue. (q_1479)
Panic easily. (q_1505)
Am full of ideas. (q_128)
Avoid difficult reading material.(q_316)
Carry the conversation to a higher level. (q_492)
Spend time reflecting on things. (q_1738)
Will not probe deeply into a subject. (q_1964)
Males = 1, Females =2
1 = HS, 2 = finished HS, 3 = some college, 4 = college graduate 5 = graduate degree
age in years
The first 25 items are organized by five putative factors: Agreeableness, Conscientiousness, Extraversion, Neuroticism, and Opennness. The scoring key is created using
make.keys, the scores are found using
These five factors are a useful example of using
irt.fa to do Item Response Theory based latent factor analysis of the
polychoric correlation matrix. The endorsement plots for each item, as well as the item information functions reveal that the items differ in their quality.
The item data were collected using a 6 point response scale:
1 Very Inaccurate
2 Moderately Inaccurate
3 Slightly Inaccurate
4 Slightly Accurate
5 Moderately Accurate
6 Very Accurate
as part of the Synthetic Apeture Personality Assessment (SAPA http://sapa-project.org) project. To see an example of the data collection technique, visit http://SAPA-project.org. The items given were sampled from the International Personality Item Pool of Lewis Goldberg using the sampling technique of SAPA. This is a sample data set taken from the much larger SAPA data bank.
The bfi data set and items should not be confused with the BFI (Big Five Inventory) of Oliver John and colleagues (John, O. P., Donahue, E. M., & Kentle, R. L. (1991). The Big Five Inventory–Versions 4a and 54. Berkeley, CA: University of California,Berkeley, Institute of Personality and Social Research.)
The items are from the ipip (Goldberg, 1999). The data are from the SAPA project (Revelle, Wilt and Rosenthal, 2010) , collected Spring, 2010 ( http://sapa-project.org).
Goldberg, L.R. (1999) A broad-bandwidth, public domain, personality inventory measuring the lower-level facets of several five-factor models. In Mervielde, I. and Deary, I. and De Fruyt, F. and Ostendorf, F. (eds) Personality psychology in Europe. 7. Tilburg University Press. Tilburg, The Netherlands.
Revelle, W., Wilt, J., and Rosenthal, A. (2010) Individual Differences in Cognition: New Methods for examining the Personality-Cognition Link In Gruszka, A. and Matthews, G. and Szymura, B. (Eds.) Handbook of Individual Differences in Cognition: Attention, Memory and Executive Control, Springer.
bi.bars to show the data by age and gender,
irt.fa for item factor analysis applying the irt model.
openness = c("O1","-O2","O3","O4","-O5"))
scores <- scoreItems(keys.list,bfi,min=1,max=6) #specify the minimum and maximum values
#show the use of the fa.lookup with a dictionary
Dataset imported from https://www.r-project.org.