Dear Pavel,
there is a beautiful paper called 'Where freedom is given, liberties are
taken' by Kleywegt and Jones, but also a wide variety of articles that
(fortunately) fought hard for the introduction of Rfree to the
(macro-)crystallographic community.
In there is mentioned the threading of an amino acid chain backwards
into the density achieving (by refinement) a lower R-value than the
original one. Since this was achieved with refinement, the former
structure was closer to the global minimum than the latter one.
Apparently none of these authors had an idea how to modify the target
function so that this would not happen - whyfore they suggested to use
cross validation to avoid it.
If you don't like this line of thought, I can offer a different one:
there is a vast number of sets of parameters that ideally fit your data:
fill your asymmetric unit randomly with atoms so that your data to
parameter ratio is 1 or lower. Refine unrestrained and your are going to
end up with an R-value of 0. For unrestrained refinement, the formula
for the R-value corresponds (maybe not for maximum likelhood based
target functions, you may have to do some translation here) to the
target function, which usually has a lower bound of zero, hence this
vast number of "structures" all reached the global minimum. Note that
the deposited structure has an R value much greater than 0, i.e. it is
far away from the global minimum.
In order to improve the situation, one modifies the target function by
adding restraints. They increase the target value of all "structures",
but in general those for the arbitrary solutions increase so much more
than that for an acceptable solution that most of those are lifted above
that of an acceptable solution.
As an example, one of the structures for the yeast polymerase I contains
about 34,500 atoms, i.e. the target function is minimised in a 138,000
dimensional space. I don't think there is a proof that any set of
restraints is ever so ideal that all false solutions are lifted above
the target value of the accepted solution. In fact, without being able
to proove it, I doubt that this the case, which lead me to the below
claim that we don,t necessarily want to reach the global minimum of the
target function.
Of course an acceptable structure actually may have a target value
representing a global minimum, but I don't think this is always true.
Best,
Tim
On 11/28/2014 05:42 PM, Pavel Afonine wrote:
> Hi Tim,
>
> you don't necessarily want to find the global minimum (...)
>
>
> this contradicts the definition of crystallographic structure refinement.
> If finding the global minimum is not what you ultimately want then either
> the refinement target or model parameterization are poor.
>
> Clearly, given complexity of refinement target function profile (in case of
> macromolecules) we unlikely to reach the global minimum; however, reaching
> it is what we aim for (by definition and construction of refinement
> program) .
>
> Pavel
>
--
Dr Tim Gruene
Institut fuer anorganische Chemie
Tammannstr. 4
D-37077 Goettingen
GPG Key ID = A46BEE1A
|