JISCMail - ALLSTAT Archives

Hi,

a number of allstaters have asked me for a copy of the replies I 
received to my question on Factor Analysis - please find this below. 
Many thanks to all those who took the time to reply. As you can see 
below there was a range of opinions on the approach to analysing likert 
data.

Best Wishes to you all

Graham

Original Question:
I have undertaken some factor analysis on a set of data derived from 
Likert type (5 point ordinal) questions. Other papers on the topic in 
question have used FA however a reviewer has pointed out FA requires the 
data to be at least interval in nature.

What do members of the list feel - is it ever acceptable to use FA on 
likert scale variables (if so when, is there any diagnostic tests?) any 
references to acceptability of use would be most appreciated.

If not acceptable what would be the prefered way of undertaking a FA 
type analysis on ordinal data?  I've heard of 'optimal scaling' though 
I've never used it.
--------------------------------------------------
Responses – truncated in places for brevity:

I'd agree with your reviewer, though to an extent it depends on how 
well-behaved your data are. As to this having been a commonly used 
methodology for such data in your topic area, my experience has been 
(and I appreciate that this will sound snooty) to have very little faith 
in most of the factor analyses I have seen: they tend to have absurdly 
small sample sizes, to violate basic assumptions, and to have 
constructed "meaningful" factors from what appears to me to be random 
variation. To return to the point, you might find it possible to employ 
polychoric correlations. See, for example,
http://flash.lakeheadu.ca/~boconno2/itemanalysis.html
---------------------------------------------
It's completely appropriate, and rarely done on anything else. The 
reviewer is talking nonsense.  Useful books on factor analysis may be:
Making Sense of Factor Analysis in Health Care Research: A Practical Guide
http://www.amazon.co.uk/exec/obidos/ASIN/0761919503
A first course in factor analysis
http://www.amazon.co.uk/exec/obidos/ASIN/0805810625/
An easy guide to factor analysis
http://www.amazon.co.uk/exec/obidos/ASIN/0415094909

Papers, things like:
Briggs, S. R., & Cheek, J. M. (1986). The role of factor analysis in the 
development and evaluation of personality scales. Journal of 
Personality, 54, 106-148.

Journals like Personality and Individual Differences, or Journal of 
Personality and Social Psychology will have lots of examples of factor 
analysis done on Likert scales (and rarely done on anything else).
----------------------------------------------------
Yes FA is meant for interval data, however it is common practice to use 
FA for Likert scales. However to strengthen your results you should look 
into RASCH analysis, which has been used a lot in education and is 
starting to appear in other fields http://www.rasch-analysis.com/. You 
can do RASCH analysis in Stata, though I believe it is tricky to 
program, ideally you would need a specific package for it (RUMM). The 
other option is to look into confirmatory FA. Programs like lisrel and 
AMOS do this. AMOS is very user friendly, lisrel on the other hand is a 
bit more tricky.  A great text is "scale development" by Robert F DeVellis.

RASCH can be tricky, but once you see what it can do for scale 
development, you'll see why it is such a great skill to have.

What about confirmatory FA? AMOS is easy to use (like SPSS) and I found 
the best way to learn was to replicate other people's studies. In papers 
that have used CFA you'll see they provide a correlation matrix, you can 
enter that into excel or some other spreadsheet program and then copy 
the model they have in the paper to see if you can re-produce the 
results (not always the case!). This would be easier than RASCH as you 
already have the model from your exploratory FA, so you would start by 
testing that and looking at the statistics to confirm/reject the EFA. 
That would certainly add validity to your results along with all the 
other validity tests you can use.
-----------------------------------------------------------
Muthen's 'MPlus' program can perform factor analysis with ordinal data.
Probably Vermunt's 'Latent Gold' software can also do it.

You can try Muthen's software. There is a free download, which can do
factor analysis for 6 variables, I think. It is at -
http://www.statmodel.com/
The model you need is example 4.2 in the User's Guide examples, which 
you can get at http://www.statmodel.com/ugexcerpts.shtml.

Mplus is very handy, because you can easily perform confirmatory factor
Analysis and factor analysis with covariates that affect the latent 
variables and/or the indicator variables. You can check through the 
other examples to see what is possible.

Alternatively, the MCMCordfactanal procedure in the "MCMCpack" package
in the open-source 'R' statistical programming language can make 'Markov
Chain Monte Carlo for Ordinal Data Factor Analysis Models'. You can 
down-load R (and MCMCPack) free from http://www.r-project.org/
--------------------------------------------
I thought the Likert-like wording was supposed to give roughly 
interval-scaled results.  Certainly, ten years ago FA on likert scales 
was 'industry-standard' in the social sciences

I've occasional tried correspondence analysis (one type of optimal 
scaling) as an alternative.  You end up with a map (similar to a biplot) 
showing how each category/point on each scale relates.  If
* all the points on the scale are being used fairly often
* you don't have too many scales to keep track of (coz you've got 
(5*number of scales) points scatterd over the map), and
* you're lucky
it can work out quite neatly.
Michael Greenacre has written a couple of books on the technique
--------------------------------------------------
IMHO, any method for doing factorial analysis using Likert scales  would 
first treat them as ordinal, then slide that into interval  scales. 
Whether they said as much or not, would depend on the  author.  That is 
how they do the analysis, like it or not.

So my suggestion is as follows:  set the middle point of the 5 point 
scale to 'neutral' and not to 'no response.'  Treat the scale as an 
interval scale, and say so out loud.  BTW, you can't report averages 
unless you do this.  You can't take an average on an ordinal scale. 
"does not compute."

Also, an interval scale means that a response halfway between  'strongly 
disagree' and 'disagree' means just that - 1.5 on the scale.

If you have some very good data, and you really want to examine cases 
of averages near the extremes, you can do a conversion that 'expands' 
the scale near the extremes.  It is related to the logit conversion,  if 
I get my names right, and is also used by Taguchi only he calls it 
Omega transform.

y' = ln (y-y[low])/(y[high] - y)  If some y's = y[low] or y[high], 
adjust by shifting the extreme values slightly to get no -inf or +inf.

At very low values of y this goes to - infinity.  at very high values 
it goes to + infinity.

Whether your data is strong enough (low measurement variance) to 
support such games, is up to you.

As for results:  If you wish to draw a medical conclusion, then a 
Likert scale survey is not your cup of tea anyway, but the ordinal- 
interval conversion may not fly past the reviewers.

If you wish to learn trends and directions and you are not concerned 
whether the result has  a p value of 1.5 or 0.9, but only whether p< 
0.1, then this will get you home.  It has for me a couple times.
------------------------------------------------------
To be blunt, I think that the reviewer is grossly misinformed. One can 
do factor analysis with any "type" of data, but special estimation 
procedures may be required. For ordered categorical data with about 5-7 
or more categories, and with the data roughly symmetrically distributed 
across those categories, maximum likelihood estimation (mle) generally 
works ok. With fewer categories and/ or non-symmetrically distributed 
data (e.g., floor effects), special estimation procedures are required 
to obtain good standard errors and model fit statistics. (Simulation 
research suggests that the parameter estimates obtained from mle are 
quite robust, so you might be able to get away with using mle with 
bootstrapped confidence intervals.) You can find a bunch of references 
on this topic on these two websites:

http://www.upa.pdx.edu/IOA/newsom/semrefs.htm
http://www.statmodel.com/references.shtml

More generally, on the topic of "types" of data "required" for certain 
statistical analyses, see

     Lord, F. (1953). On the statistical treatment of football numbers. 
American Psychologist, 8(12), 750-751.
     Lord, F. (1954). Further comment on "football numbers." American 
Psychologist, 9(6), 264-265.
     Velleman, P. F., & Wilkinson, L. (1993). Nominal, ordinal, 
interval, and ratio typologies are misleading. The American 
Statistician, 47(1), 65-72.

-- 
Dr G.S.Clarke
Lecturer in Physiology & Biometry
Faculty of Health Studies
University of Wales, Bangor
Fron Heulog
Ffriddoedd Road
Bangor
Gwynedd LL57 2EF

Tel: 	01248 383157
e-mail: [log in to unmask]




-- 
Gall y neges e-bost hon, ac unrhyw atodiadau a anfonwyd gyda hi,
gynnwys deunydd cyfrinachol ac wedi eu bwriadu i'w defnyddio'n unig
gan y sawl y cawsant eu cyfeirio ato (atynt). Os ydych wedi derbyn y
neges e-bost hon trwy gamgymeriad, rhowch wybod i'r anfonwr ar
unwaith a dilëwch y neges. Os na fwriadwyd anfon y neges atoch chi,
rhaid i chi beidio â defnyddio, cadw neu ddatgelu unrhyw wybodaeth a
gynhwysir ynddi. Mae unrhyw farn neu safbwynt yn eiddo i'r sawl a'i
hanfonodd yn unig  ac nid yw o anghenraid yn cynrychioli barn
Prifysgol Cymru, Bangor. Nid yw Prifysgol Cymru, Bangor yn gwarantu
bod y neges e-bost hon neu unrhyw atodiadau yn rhydd rhag firysau neu
100% yn ddiogel. Oni bai fod hyn wedi ei ddatgan yn uniongyrchol yn
nhestun yr e-bost, nid bwriad y neges e-bost hon yw ffurfio contract
rhwymol - mae rhestr o lofnodwyr awdurdodedig ar gael o Swyddfa
Cyllid Prifysgol Cymru, Bangor.  www.bangor.ac.uk

This email and any attachments may contain confidential material and
is solely for the use of the intended recipient(s).  If you have
received this email in error, please notify the sender immediately
and delete this email.  If you are not the intended recipient(s), you
must not use, retain or disclose any information contained in this
email.  Any views or opinions are solely those of the sender and do
not necessarily represent those of the University of Wales, Bangor.
The University of Wales, Bangor does not guarantee that this email or
any attachments are free from viruses or 100% secure.  Unless
expressly stated in the body of the text of the email, this email is
not intended to form a binding contract - a list of authorised
signatories is available from the University of Wales, Bangor Finance
Office.  www.bangor.ac.uk