You need to agree clearly what this means:
"I bet that your visual impression will not be better than > chance."
'Chance' has zero mean and what dispersion ?
Do all the plots need to be standardised to the same pooled sampling error ?
How big a sample to have a 'reasonable' power of rejecting the null hypothesis above ?
Looking foward to it !
Klim
|