Hi, a number of allstaters have asked me for a copy of the replies I received to my question on Factor Analysis - please find this below. Many thanks to all those who took the time to reply. As you can see below there was a range of opinions on the approach to analysing likert data. Best Wishes to you all Graham Original Question: I have undertaken some factor analysis on a set of data derived from Likert type (5 point ordinal) questions. Other papers on the topic in question have used FA however a reviewer has pointed out FA requires the data to be at least interval in nature. What do members of the list feel - is it ever acceptable to use FA on likert scale variables (if so when, is there any diagnostic tests?) any references to acceptability of use would be most appreciated. If not acceptable what would be the prefered way of undertaking a FA type analysis on ordinal data? I've heard of 'optimal scaling' though I've never used it. -------------------------------------------------- Responses – truncated in places for brevity: I'd agree with your reviewer, though to an extent it depends on how well-behaved your data are. As to this having been a commonly used methodology for such data in your topic area, my experience has been (and I appreciate that this will sound snooty) to have very little faith in most of the factor analyses I have seen: they tend to have absurdly small sample sizes, to violate basic assumptions, and to have constructed "meaningful" factors from what appears to me to be random variation. To return to the point, you might find it possible to employ polychoric correlations. See, for example, http://flash.lakeheadu.ca/~boconno2/itemanalysis.html --------------------------------------------- It's completely appropriate, and rarely done on anything else. The reviewer is talking nonsense. Useful books on factor analysis may be: Making Sense of Factor Analysis in Health Care Research: A Practical Guide http://www.amazon.co.uk/exec/obidos/ASIN/0761919503 A first course in factor analysis http://www.amazon.co.uk/exec/obidos/ASIN/0805810625/ An easy guide to factor analysis http://www.amazon.co.uk/exec/obidos/ASIN/0415094909 Papers, things like: Briggs, S. R., & Cheek, J. M. (1986). The role of factor analysis in the development and evaluation of personality scales. Journal of Personality, 54, 106-148. Journals like Personality and Individual Differences, or Journal of Personality and Social Psychology will have lots of examples of factor analysis done on Likert scales (and rarely done on anything else). ---------------------------------------------------- Yes FA is meant for interval data, however it is common practice to use FA for Likert scales. However to strengthen your results you should look into RASCH analysis, which has been used a lot in education and is starting to appear in other fields http://www.rasch-analysis.com/. You can do RASCH analysis in Stata, though I believe it is tricky to program, ideally you would need a specific package for it (RUMM). The other option is to look into confirmatory FA. Programs like lisrel and AMOS do this. AMOS is very user friendly, lisrel on the other hand is a bit more tricky. A great text is "scale development" by Robert F DeVellis. RASCH can be tricky, but once you see what it can do for scale development, you'll see why it is such a great skill to have. What about confirmatory FA? AMOS is easy to use (like SPSS) and I found the best way to learn was to replicate other people's studies. In papers that have used CFA you'll see they provide a correlation matrix, you can enter that into excel or some other spreadsheet program and then copy the model they have in the paper to see if you can re-produce the results (not always the case!). This would be easier than RASCH as you already have the model from your exploratory FA, so you would start by testing that and looking at the statistics to confirm/reject the EFA. That would certainly add validity to your results along with all the other validity tests you can use. ----------------------------------------------------------- Muthen's 'MPlus' program can perform factor analysis with ordinal data. Probably Vermunt's 'Latent Gold' software can also do it. You can try Muthen's software. There is a free download, which can do factor analysis for 6 variables, I think. It is at - http://www.statmodel.com/ The model you need is example 4.2 in the User's Guide examples, which you can get at http://www.statmodel.com/ugexcerpts.shtml. Mplus is very handy, because you can easily perform confirmatory factor Analysis and factor analysis with covariates that affect the latent variables and/or the indicator variables. You can check through the other examples to see what is possible. Alternatively, the MCMCordfactanal procedure in the "MCMCpack" package in the open-source 'R' statistical programming language can make 'Markov Chain Monte Carlo for Ordinal Data Factor Analysis Models'. You can down-load R (and MCMCPack) free from http://www.r-project.org/ -------------------------------------------- I thought the Likert-like wording was supposed to give roughly interval-scaled results. Certainly, ten years ago FA on likert scales was 'industry-standard' in the social sciences I've occasional tried correspondence analysis (one type of optimal scaling) as an alternative. You end up with a map (similar to a biplot) showing how each category/point on each scale relates. If * all the points on the scale are being used fairly often * you don't have too many scales to keep track of (coz you've got (5*number of scales) points scatterd over the map), and * you're lucky it can work out quite neatly. Michael Greenacre has written a couple of books on the technique -------------------------------------------------- IMHO, any method for doing factorial analysis using Likert scales would first treat them as ordinal, then slide that into interval scales. Whether they said as much or not, would depend on the author. That is how they do the analysis, like it or not. So my suggestion is as follows: set the middle point of the 5 point scale to 'neutral' and not to 'no response.' Treat the scale as an interval scale, and say so out loud. BTW, you can't report averages unless you do this. You can't take an average on an ordinal scale. "does not compute." Also, an interval scale means that a response halfway between 'strongly disagree' and 'disagree' means just that - 1.5 on the scale. If you have some very good data, and you really want to examine cases of averages near the extremes, you can do a conversion that 'expands' the scale near the extremes. It is related to the logit conversion, if I get my names right, and is also used by Taguchi only he calls it Omega transform. y' = ln (y-y[low])/(y[high] - y) If some y's = y[low] or y[high], adjust by shifting the extreme values slightly to get no -inf or +inf. At very low values of y this goes to - infinity. at very high values it goes to + infinity. Whether your data is strong enough (low measurement variance) to support such games, is up to you. As for results: If you wish to draw a medical conclusion, then a Likert scale survey is not your cup of tea anyway, but the ordinal- interval conversion may not fly past the reviewers. If you wish to learn trends and directions and you are not concerned whether the result has a p value of 1.5 or 0.9, but only whether p< 0.1, then this will get you home. It has for me a couple times. ------------------------------------------------------ To be blunt, I think that the reviewer is grossly misinformed. One can do factor analysis with any "type" of data, but special estimation procedures may be required. For ordered categorical data with about 5-7 or more categories, and with the data roughly symmetrically distributed across those categories, maximum likelihood estimation (mle) generally works ok. With fewer categories and/ or non-symmetrically distributed data (e.g., floor effects), special estimation procedures are required to obtain good standard errors and model fit statistics. (Simulation research suggests that the parameter estimates obtained from mle are quite robust, so you might be able to get away with using mle with bootstrapped confidence intervals.) You can find a bunch of references on this topic on these two websites: http://www.upa.pdx.edu/IOA/newsom/semrefs.htm http://www.statmodel.com/references.shtml More generally, on the topic of "types" of data "required" for certain statistical analyses, see Lord, F. (1953). On the statistical treatment of football numbers. American Psychologist, 8(12), 750-751. Lord, F. (1954). Further comment on "football numbers." American Psychologist, 9(6), 264-265. Velleman, P. F., & Wilkinson, L. (1993). Nominal, ordinal, interval, and ratio typologies are misleading. The American Statistician, 47(1), 65-72. -- Dr G.S.Clarke Lecturer in Physiology & Biometry Faculty of Health Studies University of Wales, Bangor Fron Heulog Ffriddoedd Road Bangor Gwynedd LL57 2EF Tel: 01248 383157 e-mail: [log in to unmask] -- Gall y neges e-bost hon, ac unrhyw atodiadau a anfonwyd gyda hi, gynnwys deunydd cyfrinachol ac wedi eu bwriadu i'w defnyddio'n unig gan y sawl y cawsant eu cyfeirio ato (atynt). Os ydych wedi derbyn y neges e-bost hon trwy gamgymeriad, rhowch wybod i'r anfonwr ar unwaith a dilëwch y neges. Os na fwriadwyd anfon y neges atoch chi, rhaid i chi beidio â defnyddio, cadw neu ddatgelu unrhyw wybodaeth a gynhwysir ynddi. Mae unrhyw farn neu safbwynt yn eiddo i'r sawl a'i hanfonodd yn unig ac nid yw o anghenraid yn cynrychioli barn Prifysgol Cymru, Bangor. Nid yw Prifysgol Cymru, Bangor yn gwarantu bod y neges e-bost hon neu unrhyw atodiadau yn rhydd rhag firysau neu 100% yn ddiogel. Oni bai fod hyn wedi ei ddatgan yn uniongyrchol yn nhestun yr e-bost, nid bwriad y neges e-bost hon yw ffurfio contract rhwymol - mae rhestr o lofnodwyr awdurdodedig ar gael o Swyddfa Cyllid Prifysgol Cymru, Bangor. www.bangor.ac.uk This email and any attachments may contain confidential material and is solely for the use of the intended recipient(s). If you have received this email in error, please notify the sender immediately and delete this email. If you are not the intended recipient(s), you must not use, retain or disclose any information contained in this email. Any views or opinions are solely those of the sender and do not necessarily represent those of the University of Wales, Bangor. The University of Wales, Bangor does not guarantee that this email or any attachments are free from viruses or 100% secure. Unless expressly stated in the body of the text of the email, this email is not intended to form a binding contract - a list of authorised signatories is available from the University of Wales, Bangor Finance Office. www.bangor.ac.uk