Print

Print


Hi Anderson,

Thanks! That clarifies alot!
I have now applied the first design in Glm:

Group = groups (25 ones, 25 twos)
EV1 = group1 (25 ones, 25 zeros)
EV2 = group2 (25 zeros, 25 ones)
EV3 = nuisance variable (50 age values, demeaned)

However, probably due to the fact that I do not split the age variable in two groups, I get the following warning: 
'Problem with processing the model: Warning - design matrix uses different groups (for different variances), but these do not contain "separable" EVs for the different groups (...)'
I expect to get this warning also in the design with the behavioral measurements, since I do not split age there either, but do define two groups in the 'group' EV.

Can I ignore this message, since the age variable is nuisance? Or should I use the groups file to define exchangeability blocks? (Until now I have never done that..)

Best,
Z.


On Mon, Feb 23, 2015 at 9:01 AM, Anderson M. Winkler <[log in to unmask]> wrote:
Hi Zsuzsi,

I take that you meant to send this to the list, so I'm forwarding. Please, see below:


On 22 February 2015 at 11:40, Z. Sjoerds <[log in to unmask]> wrote:
Hi Anderson!

Thanks so much for your help! It seems a struggle to me, to fully comprehend why you make certain choices for a design matrix. But I do want to fully comprehend, si I am sure I do the right thing in the future.

An issue that is left for me, is what you say in the beginning of your last email: all EVs that are set to 0 are nuisance variables. This is what I know also from the SPM design matrices. Statistically this means that the variance explained by these variables is left out ('regressed out') in your test. But when I compare the two groups on FA value (independent of behavioral outcome; in principle a whole different question), I don't want any variance explained by the behavioral measurements to be taken out. I might miss some important group differences due to the fact that one of the behavioral measurements differs significantly between groups. At this level of hypothesizing, I don't want anything to do with the behavioral measurements (yet). Only in the later step, when I regress FA value with behavior, I could isolate a possible group*behav effect.

With 50 subjects, these 4 EVs will hardly become an issue, so I don't think there's a need for multiple models. But if you note that the EVs with the behavioural variable are not significant, and if it isn't anyway a sensible hypothesis or evidence that they'd somehow be associated with FA, you can remove them from the model and run just a 2 group comparison (presumably with age, as you mentioned).
 
So therefore I thought that I need two design matrices: one for the simple FA group comparison:

Group = group (25 ones, 25 twos)
EV1 = group1 (25 ones, 25 zeros)
EV2 = group2 (25 zeros, 25 ones)
EV3 = nuisance variable (age, demeaned)

Then the contrasts of interest would be:
C1 = F-contrast: 1 -1 (main effect of group)
C2 = T-contrast: 1 -1 (post-hoc group1 > group2)
C3 = T-contrast: -1 1 (post-hoc group2 > group1)

This way I actually regress out the only nuisance variable of interest (and no additional variables that are not nuisance to me)
Then I will run the Two-Sample t-test:

randomise -i AllFAmerged4D -o TwoSampT -d design.mat -t design.com -mask FAthresholdMask -x --T2

There's no need for the F-test, although if you use randomise as shown, it won't make any difference because you didn't use the "-f design.fts". If the reason for the F-test is to account for multiple testing, you can use Bonferroni (0.05/2), It won't be conservative in this case.

Also, variables that aren't in the contrast are nuisance, regardless of them being in fact interesting in some other contrasts (or in any other way) or not. And in general, if a variable may affect the response variable (or is known to affect), it can be kept, even if not significant, as it absorbs some of the variance and makes the test for the other variables more powerful.
 

(Importantly: The 4D file is ordered according to participant number, not according to group. But ofcourse in my design I kept the same order, so e.g. for the first columns group: 1 2 1 1 2 2 1 2 2 1 1 1 2 2 1 2 2 1 1 2 etc.. and the same order for EV1 and EV2; I assume that is fine?)

The order of the rows don't matter as long as they match the 4D file.
 

Second, I will make a design matrix for the second, behavioral-related question (and then defining both behav measurements in one design):

Group = group (25 ones, 25 twos)
EV1 = group 1 * behav 1
EV2 = group 2 * behav 1
EV3 = group 1 * behav 2
EV4 = group 2 * behav 2
EV5 = nuisance variable (age, demeaned)
***So, do I need an intercept here???*** or do I need to define the two groups by two extra EVs here anyways?

Must include the groups.
 

And then the contrasts of interest:

C1 = T-contrast: 1 1 0 0 0 (main positive effect of behav1, independent of group)
C2 = T-contrast: -1 -1 0 0 0 (main negative effect of behav1, independent of group)
C3 = T-contrast: 0 0 1 1 0 (main positive effect of behav2, independent of group)
C4 = T-contrast: 0 0 -1 -1 0 (main negative effect of behav2, independent of group)
C5 = F-contrast: 1 -1 0 0 0 (group interaction test on behav1)
C6 = T-contrast: 1 -1 0 0 0 (post-hoc t-test group1*behav1 > group2 *behav1)
C7 = T-contrast: -1 1 0 0 0 (post-hoc t-test group1*behav1 < group2 *behav1)
C8 = F-contrast: 1 -1 0 0 0 (group interaction test on behav2)
C9 = T-contrast: 1 -1 0 0 0 (post-hoc t-test group1*behav2 > group2 *behav2)
C10 = T-contrast: -1 1 0 0 0 (post-hoc t-test group1*behav2 < group2 *behav2)

Is this correct??

I think this was answered in an earlier email, just check that...
No need for these F-tests, just use Bonferroni.

 
Sorry if I keep on repeating myself, but I am afraid I have been doing things wrong until now, and want to make 1000% sure that I do it correctly now, and also understand it independently, so I don't have to bother you in the future anymore ;-)
By the way, the wiki for Glm / Randomise is offline from the fmrib site. Is there a reason for that? I tried to find some answers there, but that didn't work out..

It should be back now.

All the best,

Anderson


 

Thanks again; hope you can help comprehend the last issues!

Best,
Zsuzsi



On Sat, Feb 21, 2015 at 6:21 PM, Anderson M. Winkler <[log in to unmask]> wrote:
Hi Zsuzsi,

Please see below:

On 21 February 2015 at 14:56, Z. Sjoerds <[log in to unmask]> wrote:
Hi Anderson,

Thanks for your quick reply!
I am slightly surprised that I can put all EVs and contrasts in one design; always understood to make different designs for different questions, especially when it involves two different statistical tests: 2-sample t-test for FA comparison between groups, versus multiple regression for group comparisons on slope with a continuous variable (e.g. behavioral measure). 

It's all the same -- all particular cases of the same model.

 
So, how does that work then, for the fact that age is nuisance here, but the behav EVs are not, especially in the case of 'simple' t-test group comparison of FA, irrespective of behavioral measurement?

Each contrast is tested separately, and for each contrast, the EVs marked as 0 are nuisance. The others (non-zero) are effects of interest.

 
Isn't there then also somehow a correction for the behav scores, as is for age, and since the groups differ in one of the behav scores, don't I then remove an important part of the explained variance between the groups (in the simple FA 2-sample t-test)?

Given behav1 and behav2 are correlated, it's impossible to disambiguate them, and as coded, each test with check the unique contribution of the respective scores. It may not be what you want, hence the suggestion to collapse both scores with PCA. I think you mentioned that the correlation is high, and in this case, another possibility is simply ignore one of them (that is, remove either behav 1 or behav 2 it from the design altogether).

 
And I assume with C5 and C6 you don't mean an interaction? Because my first, main question regards the simple FA comparison between groups. The whole-sample (two groups pooled) regression with the behavs and group*behav interaction are part of my second question.

Yes, sorry, the C5 and C6 are not for interactions, just group differences:

C5: [1 -1 0 0 0 0 0]: (group 1 > group 2)
C6: [-1 1 0 0 0 0 0]: (group 1 < group 2)

 
Shouldn't there also be contrasts that look at whole-sample regression with behav? e.g. 0 0 1 1 0 0  (positive association behav 1), 0 0 -1 -1 0 0 (negative association with behav 1), 0 0 0 0 1 1 (positive association with behav 2), 0 0 0 0 -1 -1 (negative association with behav 2)?

If you want, yes, but if the interaction is significant, these contrasts won't be useful. And if the interaction isn't significant, a single EV for behav (not split between groups) is more appropriate.

 
And I assume demeaning should be done based on the whole-sample mean? Not the mean per group? I keep on getting confused on that..

Yes.
 

Last question: I see you did not add the intercept in the design (e.g. EV1 all 1's); so, when exactly do you have to add it (and how does it or does it not influence the need for demeaning)? I did add it in my design before; is that a flaw? Did this likely influence my results? Is there documentation on the role of the intercept here, and when to use it, and when not, so I can also decide in future studies how to set up my design matrix?

The intercept codes the overall mean. Here you want to compare the means of each group, so the intercept is split across both groups, which are tested with C5 and C6.
The intercept can be modelled as a single EV, but this entails modifying other EVs.

All the best,

Anderson

 

On Sat, Feb 21, 2015 at 10:45 AM, Anderson M. Winkler <[log in to unmask]> wrote:
Hi Zsuzsi,

For these hypotheses, you need in the design matrix:

EV1: group 1
EV2: group 2
EV3: group 1 * behav 1
EV4: group 2 * behav 1
EV5: group 1 * behav 2
EV6: group 2 * behav 2
EV7: age

Mean-center behav 1 and behav 2 before computing the interaction EVs.

The contrasts will be:
C1: [0 0 1 -1 0 0 0]: interaction group vs behav 1
C2: [0 0 -1 1 0 0 0]: interaction group vs behav 1 (opposite sign)
C3: [0 0 0 0 1 -1 0]: interaction group vs behav 2
C4: [0 0 0 0 -1 1 0]: interaction group vs behav 2 (opposite sign)
C5: [1 -1 0 0 0 0 0]: interaction group vs behav 1 (group 1 > group 2)
C6: [-1 1 0 0 0 0 0]: interaction group vs behav 1 (group 1 < group 2)

All these hypotheses can be done with t-tests, so no need for F-tests. The 6 contrasts mean multiple testing. You can use Bonferroni over the 6 tests, so the significance level becomes 0.05/6 = 0.00833. It's not as conservative as it may appear, because the positive and negative versions can be treated as independent with data as you have.

Still, if you want, you can run an F-test comprising C1, C3 and C5. The result of this test is just a shield to minimise multiple testing, as the interpretability is a bit difficult given that the respective t-tests test entirely different things.

The above ignores the fact that behav 1 and behav 2 are correlated. Maybe you can consider using PCA to derive just 1 new behavioural score that captures the variance of these two.

All the best,

Anderson


On 20 February 2015 at 12:11, Zsuzsika Sjoerds <[log in to unmask]> wrote:
Dear Anderson & FSL list,

Following recent posts on the design matrix for e.g. randomise, I started doubting about my own glm approach. I am originally trained with the SPM gui, where one can build contrasts on the go (e.g. define post-hoc t-contrasts only after seeing an interaction effect in F-contrast). For the FSL/glm approach, all contrasts need to be built beforehand (also post-hoc t-contrasts) to run at once in randomise, so an extra amount of forward-planning is needed.
I read about several approaches that I had not applied so far. I hope someone could help figure out what the best GLM design would be for my analyses.

I have a sample of 50 participants, split in two even groups (group1, group2). First I want to compare the groups on FA values, as obtained by TBSS, and second I want to regress the FA values (whole sample & group interaction) with two behavioral measurements (behav1, behav2). These two behavioral measurements negatively correlate with each other, and both also correlate with age (one shows a positive, the other shows a negative correlation). Therefore I want to add age as a nuisance variable, to make sure I don't look at age effects.

For my study I have the following 5 research questions:
- group difference on FA, irrespective of behavioral measurements
- (whole sample) association with behav1
- group * behav1 interaction (and post-hoc t-test, if the interaction is significant)
- (whole sample) association with behav2
- group * behav2 interaction (and post-hoc t-test, if the interaction is significant)

eventually I want to apply this same approach on tractography data, so it would be good to have things straight on a correct design at this point.

Until now I have created 5 separate design.mat files for the 5 questions above (mainly due to the order in which I explored my dataset and tried out designs in the beginning), but I can imagine this is not optimal due to multiple comparisons, and degrees of freedom?


Therefore my first question: can I create an optimal glm design that combines (several of) these questions?

For instance 3 designs:
1. main group comparison, irrespective of behavioral measurement
2. behav1, with EVs and contrasts exploring both a main effect of behav1, and a group interaction
3. behav2, with EVs and contrasts exploring both a main effect of behav1, and a group interaction

However, in a recent post I read that the main group difference (irrespective of behavioral measurement) and the behavioral regression could be entered in one design, together with the nuisance variable (although now behavior * group interaction was studied in this post, as far as I could see)
 (https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind1502&L=FSL&F=&S=&X=0911D30C4E89A0952E&Y=sjoerds.zs%40gmail.com&P=234382).
So then for me this would look like:
(EV1 = intercept? 50 ones??)
EV2 = group1 (25 ones, 25 zeros)
EV3 = group2 (25 zeros, 25 ones)
EV4 = nuisance (50 demeaned values based on whole sample mean)
EV5 = group1 behav (25 demeaned values - based on whole sample mean? 25 zeros for group2)
EV6 = group2 behav (25 demeaned values - based on whole sample mean? 25 zeros for group1)

In that case it was suggested to simply make the F-contrast 0 1 -1 0 0 0 for group differences, and two t-contrasts for possible post-hoc tests:
0 1 -1 0 0 0
0 -1 1 0 0 0

And then I assume, the whole-sample regression with behav would be contrasted as:

t-contrast 0 0 0 0 1 1 (for a positive association)
and
t-contrast 0 0 0 0 -1 -1 (for a positive association)

and group-interaction on behav would be:
f-contrast 0 0 0 0 1 -1 (plus respective t-contrasts if f-contrast shows significance)

However, not taking the two behav EVs (of interest) into account in the first contrast (0 1 -1 0 0 0), I assume that these behav EVs are also considered nuisance variables, and therefore they influence the explained variance between the groups? This is the reason why I earlier built separate GLMs for the non-behavior related group differences, versus regression with behavior. But do I understand correctly now that it is also fine to combine these EVs in one design? How does randomise see the difference between nuisance variables and variables of interest then?
I do assume that the regressions with the two different behavioral measurements should however be defined in two separate designs, especially because of their colinearity. But if I combine pure (non-behavior related) group difference EVs with behavioral EVs, I have the same first contrast (0 1 -1 0 0 0) in both designs (for behav1 and behav2).. hence my confusion.


One other important question: I have manually demeaned both my behav and nuisance (age) parameters. Therefore I don't add the -D in the command in randomise, but did add an intercept as first EV (filed with ones, and all following EVs move a number). But now I read that it was adviced against it?
So, in my case: do I need to add an intercept, and how do I handle demeaning??

Thanks in advance!
Best,
Zsuzsi

--
Z. Sjoerds, PhD
Postdoctoral researcher

Max Planck Institute for Human Cognitive and Brain Sciences
Fellow-Group Cognitive and Affective Control of Behavioral Adaptation
Group Schlagenhauf, Room C211
Stephanstraβe 1A
04103 Leipzig
Germany

[T]: +49 (0) 341 9940 2471
[F]: +49 (0) 341 9940 2499
[E]: [log in to unmask] / [log in to unmask]




--
Z. Sjoerds, PhD
Postdoctoral researcher

Max Planck Institute for Human Cognitive and Brain Sciences
Fellow-Group Cognitive and Affective Control of Behavioral Adaptation
Group Schlagenhauf, Room C211
Stephanstraβe 1A
04103 Leipzig
Germany





--
Z. Sjoerds, PhD
Postdoctoral researcher

Max Planck Institute for Human Cognitive and Brain Sciences
Fellow-Group Cognitive and Affective Control of Behavioral Adaptation
Group Schlagenhauf, Room C211
Stephanstraβe 1A
04103 Leipzig
Germany





--
Z. Sjoerds, PhD
Postdoctoral researcher

Max Planck Institute for Human Cognitive and Brain Sciences
Fellow-Group Cognitive and Affective Control of Behavioral Adaptation
Group Schlagenhauf, Room C211
Stephanstraβe 1A
04103 Leipzig
Germany

[T]: +49 (0) 341 9940 2471
[F]: +49 (0) 341 9940 2499
[E]: [log in to unmask] / [log in to unmask]