Dear all,
I am using a very large dataset (N=1.3m) to analyse
the effect of a categorical variable on gestational
age at delivery (continuous variable). I used linear
regression
(regress in stata) but as you would probably expect
the residuals are not normally distributed. I used
qreg in stata but the results do not look right
(probably there is a convergence problem).
Few months ago someone suggested that if we have a
large dataset we can use linear regression even if the
residuals are not normally distributed. Is this right?
If yes is there any reference?
Also, I could not find a meaningful transformation for
gestational age and cox regression would not work
because I need the effect of exposure on continuous
gestational age.
Any help is highly appreciated
Best Wishes
Ali
___________________________________________________________
The all-new Yahoo! Mail goes wherever you go - free your email address from your Internet provider. http://uk.docs.yahoo.com/nowyoucan.html
|