Thank you very much to all those who offered suggestions in response to my posting 'Evaluation of interactive effects when main effects are not significant'. I have not been able to reply to all of you individually but the response was very encouraging and your contributions were appreciated. A few individuals expressed an interest in having your suggestions circulated. I have therefore endeavoured to provide a summary below. I have also included my original message (with a clarification included on what I meant by significant). I shall try to read these suggestions collectively as each has a special contribution to make to the discussion. Please feel free to get back to me if you have any further relevant comments to offer. Best wishes Margaret *********************************************************************** Original message: Dear Statisticians When building a multivariate model to test for associations between an outcome variable and multiple factors, I tend to avoid testing for a significant interactive effect between two factors if I have already found that the main effects for the corresponding factors are not statistically significant. [I SHOULD ADD HERE THAT WHEN PROCEEDING IN THIS WAY, I USE A VERY HIGH SIGNIFICANCE LEVEL TO TEST FOR MAIN EFFECTS AT THIS SCREENING STAGE.] Nevertheless, I am conscious that there is some lack of consensus amongst statisticians as to whether this is an appropriate approach to take. I would be interested therefore to receive individual views (together with explanations) as to whether or not the above approach is justifiable. The above issue is strongly related to the question of what to conclude if an interactive effect proves to be significant in a multivariate analysis when neither of the main effects were significant. I would therefore very much welcome some suggestions here too! Many thanks Best wishes Margaret Response 1 =========== I could easily see a reason to test for interactive effects even when main effects are not significant. Take for example motor insurance and a model that tries to predict who will have an accident. Consider and extreme example to make a point. factor 1: two groups of cars one with the break pedal at a normal distance from the driving seat and another car designed for people with long legs. factor 2: height - small and tall - splitting the population 50%/50% If we further assume that cars are dished out randomly between tall and small people, then of course there will be lots of accidents, however you will spot nothing when just looking at the main factors. For example half the small people crash due to being in tall peoples cars and vice versa. Likewise half of each lot of cars crash due to being driven by people with the wrong length legs! However the interactive effect will ofcourse tell everything! Slightly exaggerated but I could imagine this in real life too. I'd be interested to hear what field you are using these models in. Response 2 ========== Here's an example that convinced me... A characteristic is measured at baseline; subjects are then randomized to treatment or control and subsequently the characteristic is measured again at a second timepoint. It would not be expected that there would be a significant difference between the groups at baseline, nor between the first and second timepoints for the control subjects. But if the treatment has an effect on the measured characteristic, then (assuming the sample size is large enough) it would be expected to find a significant treatment-timepoint interaction. Interestingly, this is a study design that is often found, but the old adage about not testing for interactions if the main effects are not significant still prevails. ============================================================= Response 3 =========== A possible approach to investigate here is the method of multi-factor dimensionality reduction, which has its own web page at http://www.multifactordimensionalityreduction.org/ and seems to be getting increasingly popular in the genetics sector (rightly or wrongly). _____________________________________________________________________ Response 4 =========== As you probably know, fitting an interaction without a main effect violates Nelder's marginality principle. 1. Nelder JA. A reformulation of linear models. Journal of the Royal Statistical Society A 1977;140:48-77. In commonsense terms I would say that one should not confuse importance with significance. A main effect may be relatively small but it is difficult to imagine it could be exactly zero if an interaction is present. Either leaving the main effect in will make little difference to inference about the interaction, in which case what's the problem, or it will, in which case it is unwise to leave it out. You may find the attached of interest. See in particular section 2.2. [The reference for the attached article is: Seen, S., “Added Values (2004): Controversies concerning randomization and additivity”, Statistics in Medicine, 23: 3729 – 3753.] Response 5 ========== As a statistician with ample experience in psychology and medicine, my advice is to first test interaction, and only if interaction is absent, have a look at main effects. For instance, in comparing psychotherapy with pharmacotherapy for treating depression, it may turn out that averaged across severely and mildly depressed patients, there is no therapy effect (main effect), whereas psycho is better than pharma for mild depression and worse than pharma for severe depression. Main effects obscure such clinically important interactions, although I concede that they may be rare. In unbalanced designs or with other coding schemes than -1,+1 (e.g. if we use regression with(0,1) coding of predictors), things are even worse: the p-values of the main effects are then arbitrary (i.e. dependent on the coding used) as long as the interaction term is in the model. In contrast, the p of the interaction term itself is invariant across coding schemes as long as both main effects are included into the model. You may easily verify this on some dataset, by using (0,1) coding and (-1,+1) coding and comparing p-values (continued) P.s. I realized somewhat late that your context may be different from mine. With multiple factors in a factorial design, all of which are of interest, I guess you may often be forced to start with main effects, except for a few interactions based on prior theory. In that case, testing for interactions only between those main effects which were significant, makes sense in order to prevent highly complicated models. In our context we typically have one factor of interest, e.g. treatment, and a series of 5 up to 10 covariates included for increasing power or testing interaction with treatment. In that case, we usually start with a model including all treatment by covariate effects, but not covariate by covariate effects, and then reduce the model by deleting non-significant interactions. So it depends on the number of factors of interest whether starting with main effects only is better, or starting with main and (twoway) interactions, I guess. Response 6 ========== I was unaware of any lack of concensus. It is quite possible for two variables (x1, X2) to be unrelated to a third (Y), but still have their interaction (x1*x2) be related to Y. Therefore, you cannot judge the presence or absence of an interaction based on the model without interactions. I have attached sas code and results which demonstrate this. Also this is a multivariable analysis, not multivariate, which refers to multiple dependent variables. Response 7 ========== It is generally accepted practice not to include an interaction in a model unless the main effects are also present. However, that is not to say that the main effects need to be significant. If you think that an interaction is important, then if it is significant, it should be included in the model. By my reasoning, if you do this, then the main effects should be included also, whether or not they are significant. At first sight, this might seem strange but it is just what we do with the mean and the main effects. If we think that main effects are important and they are also sifnificant, then we usually include a mean in the model irrespective of whether or not it is significantly different from zero. The reason that we do not usually have to ponder over this is that statistical software usually fits a mean by default, whatever other terms are specified. Response 8 ========== 1) Consider a graphical response surface, a 2-factor, 2-level one for simplicity, in which the main effect of a (b-) is positive in a, while the main effect of a(b+) is negative in a. The overall main effect of a will be near nil - insignificant. Likely enough, same for b. I have run across real data that does this, and the graphical display is about the only way I or the client & students, could understand it. 2) Moral, it is possible that main effects will be insig, while interactions will be sig. 3) Geo. Box, verbally & I think in BH^2, says that if you have a significant interaction, then you _must_ retain the main effect terms in the model, for mathematical reasons. I've seen a case where dropping the main effects caused weirdness, but can't recall the exact conditions at the moment. So I take George's advice. 4) If I were doing what you're doing, I would let the significance of the interaction drive the choice of retention, not the other way around. Response 9 ========== I cannot understand why there is lack of consensus . Consider a 2X2 factorial (AXB) and the four treatment means A1B1, A1B2, A2B1 and A2B2. A set of coefficients defining the contrasts of means equivalent to o the main effect for factor A would be (-1 -1 1 1) o the main effect for factor B would be (-1 1 -1 1) and o for the AXB interaction is (1 -1 -1 1) So if A1B1 = A2B2 = x1 say and A1B2 = A2B1 = x2 say o the main effect for A is A2B1 + A2B2 - A1B1 - A1B2 = x2+x1 - x1 - x2 = 0 o the main effect for B is A1B2 + A2B2 -A1B1 - A2B1 = x2 + x1 - x1 - x2 = 0 o the interaction effect is A1B1 + A2B2 - A1B2 - A2B1 = x1 + x1 - x2 - x2 = 2 * (x1-x2) A simple plot of this example may help illustrate the point So you can't assume that a null main effects model is sufficient. Response 10 The decision about what effects to look for in a model depends on many aspects, and it would not be sensible to make simple rules for all occasions. Knowledge of the underlying science will often guide what is sensible. If you are modelling a well-understood mechanism in terms of factors whose association with the outcome is generally known, there is little need to look at interactions between factors whose main effects prove unimportant unless the interaction is expected. This is the general approach in confirmatory clinical trials, where it is unusual even to see analysis of interactions between factors that are statistically significant, unless one is the treatment factor. But if you have no expectation of the associations, it may well make sense to look at all interactions, as is generally done in multi-factor agricultural experiments in novel research. You need to bear in mind that statistical significance is of secondary importance. The size of the effects, and their relevance to the underlying science, should always come first. It clearly is possible that an interaction can be important (scientifically and statistically) when one or both main effects appear not to be. But the same can be said of three-factor interactions in respect of the constituent two-factor interactions, and of four-factor interactions, and so on. The second question is easier to answer. If an interaction is statistically significant, then the question of whether the main effects are significant is irrelevant. You still need to look at the scientific importance of the effects, which may be summarized in terms of main effects if the interaction effect is scientifically small, or in terms of a two-way effect. But the interaction test has answered the secondary question about the importance of both the factors from a statistical point of view. Response 11 …it's not inconceivable that an interaction is signif even if the main effects of its factors are not. Two caveats however * the appropriate test for whether the interaction significantly improves model fit is the test for the combined main effects of A and B as well as the AxB interaction, not just the interaction term * I'm happy to try this where there are not many factors/explanatories to be tested; if I were screening a lot of explanatory variables I might restrict myself to just looking at main effects initially, in the interests of sanity. Although an X-shaped interaction which is significant without showing up in the main effects is conceivable (Is it sensible in the case you're asking about?), more often I find interactions where there is some indication of an effect (maybe not significant, but with comparatively low p-values) in the main effects of one or both of the explanatories. So I guess if you are screening lots of explanatories for closer analysis, don't set the threshold for acceptance too high Response 12 Finding neither main effect to be 'significant' has no bearing on the interaction. Cell Mean A1B1 8 A1B2 6 A2B1 6 A2B2 8 with a suitable sample size and residual variance shows why. Excluding main effects ('significant' or not) when you have the interaction in the model ('significant' or not) violates marginality. That is seldom a good idea. Response 13 I sense you may have already had some replies, possibly along the same lines as this - but I was taught that main effects are not interpretable unless the interaction is non-significant - so, if there is reason to look / test for an interaction, this should be done first. I have encountered many examples over the years where the interaction was significant, and one or both of the main effects were also significant - just as I have also found many situations where the interaction was significant but neither main effect was significant. So I would wish to argue that you should reverse your policy. "Pound to a penny" that you will get replies saying the exact opposite. That is why statistics can at time be such an interesting philosophical subject. --------------------------------- New Yahoo! Mail is the ultimate force in competitive emailing. Find out more at the Yahoo! Mail Championships. Plus: play games and win prizes.