Hi Ed, I didn't really open a can of worms... Sorry if I did! > But there are many things affecting R-factors not > being reported. Do I have to deposit the bulk solvent mask? Not that many and most of them are reported (at least those that may affect the R-factor "significantly"). The model structure factor is (well different programs may define it differently, but they normally report the corresponding parameters) (as used in phenix.refine): Fmodel = scale_overall * exp(-h*U_overall*ht) * (Fcalc + k_sol * exp(-B_sol*s^2/4) * Fmask) - anisotropic scale matrix (U_overall) is reported as "REMARK 3 B11 :", ...; - ATOM and ANISOU is enough to calculate Fcalc (the difference between scattering tables used: it1992, wk1995 or n_gaussian will not make visible difference) (the difference in algorithm FFT vs DIRECT used to compute Fcalc will not, normally, make difference too); - Bulk solvent k_sol and B_sol are reported: "REMARK 3 K_SOL" or similar; - Reproducing Fmask you at least need "grid_step", "solvent radius" and "shrink truncation radius", which phenix.refine reports all: REMARK 3 BULK SOLVENT MODELLING. REMARK 3 METHOD USED : FLAT BULK SOLVENT MODEL REMARK 3 SOLVENT RADIUS : 1.11 REMARK 3 SHRINKAGE RADIUS : 0.90 REMARK 3 K_SOL : 0.347 REMARK 3 B_SOL : 29.392 So I think you have all (or most of) you need to reproduce the R-factor statistics IF PDB FILE IS COMPLETE. > I think > the fundamental question is why do you need to be able to reproduce the > exact R-factors. Perhaps depositing FC and PHIC along with FP and SIGFP > will solve your problem? > Given so little information and effort to do so, I don't see why not? It's just a simple formula that relies on a number of simple parameters that are easy to report and in fact they are reported in PDB file in most of cases. If you have a few atoms model, you could use a pen and calculator to do so -:) It's a great and simplest validation tool: if you unable to reproduce the R-factors then something is wrong with the 1) file, 2) structure or 3) your software. However, if you explicitly allow the R-factors to be not ("exactly") reproducible, then you immediately loose the way to address "1)-3)". > Don't get me wrong - I think it's important that deposited structures > provide complete information about the model. But why are riding > hydrogens so particularly important when reporting crystallization > conditions is not mandatory? Or bulk solvent parameters? Or geometry > restraints you used for the custom ligand (thus it's not in the standard > libraries)? > Absolutely agree! So, let's make a tiny step forward and keep whatever we can easily keep, and hope that in future more information will be preserved (such as complete foot-print of restraints used and so on). > I did a quick (and dirty) survey of the PDB and found that less than 2% > of structures report hydrogens. Well, years back people used to cut low resolution data at 5...8A until it became clear that it's a not so good idea. Nowadays, I don't think anyone will throw away low resolution data. Similar rationale with hydrogens. Let's follow the progress and not look back (telling myself) -:) All the best! Pavel.