JISCMail - CCP4BB Archives

Jenny,

I of course would suggest that you follow Olve's advice, and use  
theseus to do a maximum likelihood, simultaneous superposition of all  
your structures ( http://www.theseus3d.org ).  The variable bits,  
like your loop, will be naturally down-weighted in a rigorous  
statistical manner.  Then you can look at the average structure file  
that is output (_ave.pdb at the end of the filename), and the B- 
factor column has the overall RMSD for each atom in there.  You can  
look at the full superposition (the _sup.pdb file) in rasmol or in  
pymol with the 'set all_states, on' command.

However, if you really need to do the very analysis that you asked  
about, the following bash script will do exactly that with theseus  
(you need both awk and theseus in your executable path).  It prints  
out the average RMSD for the atoms you specify in the loop, after  
pairwise least-squares superpositioning on all atoms other than the  
loop, for all possible pairwise combinations of your pdb files. (Note  
that in this script all backslashes '\' must have a carriage return  
immediately after them.)  You will need to change the lower and upper  
values at the top of the script (inclusive for the loop you want  
excluded).  You invoke the script something like "karen.sh pdb1.pdb  
pdb2.pdb pdb3.pdb" or "karen.sh *.pdb" to do all the .pdbs in one  
directory.  If you have any problems or have other specific  
superpositioning issues I'm glad to help out.

Cheers,

Douglas


karen.sh
#################################

#!/bin/bash

# everything including and between lower and upper
# is excluded from the superposition
lower=40;
upper=60;

pdbs=($*);

for (( i = 0; i < ${#pdbs[@]}; ++i ))
do
   for (( j = 0; j < i; ++j ))
   do
     name="${pdbs[i]%.*}_${pdbs[j]%.*}";
     theseus -l -r ${name} -S ${lower}-${upper} ${pdbs[i]} ${pdbs[j]}\
             > ${name}.log;
     rmsd=$(cut -c 7-11,61-67 ${name}_ave.pdb |\
     awk '{if ($1 > lo && $1 < up) {sum += $2; n++}}; END {print sum/ 
n}'\
     lo=${lower} up=${upper});
     echo "${name} rmsd = ${rmsd}";
   done
done


####################################


^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`
Douglas L. Theobald
Department of Biochemistry
Brandeis University
Waltham, MA  02454-9110

[log in to unmask]

              ^\
    /`  /^.  / /\
   / / /`/  / . /`
  / /  '   '
'


On May 11, 2007, at 1:58 PM, Olve Peersen wrote:

> I would highly recommend Doug Theobald's program Theseus for this -  
> the pictures at www.theseus3d.org say it all.  Theseus does maximum  
> likely hood superimpositions of multiple structures (i.e. NOT  
> pairwise against a "master" copy), and the real beauty of it is  
> that you don't have to pick which residues you want to  
> superimpose.  Places where the whole set of structures show  
> divergence are effectively down-weighted and don't contribute much  
> to the final solution vs. least squares where every atom position  
> has equal weight and the "bad" parts screw up the alignment of the  
> "good" parts.  For this, I would do a Theseus superposition of all  
> the structures and then analyze the set of superimposed structures  
> by whatever method you want (e.g. rmsd of variances in important  
> sections of the structures).
>
> - Olve
>
> -------------------------------------------------------
> Olve Peersen
> Associate Professor
> Dept. of Biochemistry & Molecular Biology
> 1870 Campus Delivery
> Colorado State University
> Ft. Collins, CO  80523-1870
> -------------------------------------------------------
> 970.491-0433    Office  (MRB 279)
> 970.491-0271    Lab     (MRB 149)
> 970.491-0494    Fax
> [log in to unmask]
> -------------------------------------------------------
>
> On May 11, 2007, at 11:15 AM, Donnie Berkholz wrote:
>
>> Eleanor Dodson wrote:
>>> It is a bit clunky - you can use siperpose molecules - fit  
>>> residues to
>>> fit a selected range (1-40; 60-100 say) and write out a complete  
>>> fitted
>>> pdb file. Then you could use a VERY old program
>>> compar  xyzin1 original.pdb xyzin2 fitted.pdb  (xyzin3 another.pdb)
>>> and it will match all pairs with the same RESIDUE ID and give the  
>>> RMSD
>>> distance
>>>
>>> There is documentation for it.
>>
>> There's a nice (non-CCP4) program called ProFit that does a pretty  
>> nice
>> job of superimposing with a lot of flexibility.
>>
>> Thanks,
>> Donnie

>> On May 10, 2007, at 6:45 PM, Jenny wrote:
>>> Hi, All,
>>>
>>> I have a question about rmsd calculation.
>>>
>>> I have some pdbs (100 residues ) and these pdbs differ pretty  
>>> much only the loop region 40-60. Is there any easy way that I can  
>>> superimpose the fixed region ( 1-40,60-100) and then calculate  
>>> the rmsd for the loop?I need to calculate for each pair, so if  
>>> there is any script or program available to do this quickly, that  
>>> would be great.
>>>
>>> Thanks.
>>>
>>> Jenny