I hope that some [X]Emacs expert can rewrite Charlie Bond's wonderful pdb-mode to work with mmCIF files (or at least the coordinate bits)
… for exactly the reasons Phil Jeffrey points out
Phil
On 8 Aug 2013, at 00:54, "Jeffrey, Philip D." <[log in to unmask]> wrote:
> Nat Echols wrote:
> > Personally, if I need to change a chain ID, I can use Coot or pdbset or many other tools. Writing code for
> > this should only be necessary if you're processing large numbers of models, or have a spectacularly
> > misformatted PDB file.
>
> Problem. Coot is bad at the chain label aspect.
> Create a pdb file containing residues A1-A20 and X101-X120 - non-overlapping numbering.
> Try to change the chain label of X to A.
> I get "WARNING:: CONFLICT: chain id already exists in this molecule"
>
> This is (IMHO) a bizarre feature because this is exactly the sort of thing you do when building structures.
>
> Therefore I do one of two things:
> 1. Open it in (x)emacs, replace " X " with " A " and Bob's your uncle.
> 2. Start Peek2 - that's my interactive program for doing simple and stupid things like this. I type "read test.pdb" and "chain" and Peek2 prompts me at perceived chain breaks (change in chain label, CA-CA breaks, ATOM/HETATM transitions &c) and then "write test.pdb". Takes less than 10 seconds. CCP4i would probably still be launching, as would Phenix.
>
> The reason I do #1 or #2 is not to be a Luddite, but to do something trivial and boring quickly so I can get back to something interesting like building structures, or beating subjects to death on CCP4bb.
>
> What's lacking is an interactive, or just plain fast method in any guise, way of doing simple PDB manipulations that we do tons of times when building protein structures. I've used Peek2 thousands of times for this purpose, which is the only reason it still exists because it's a fairly stupid program. A truly interactive version of PDBSET would be splendid. But, again, it always runs in batch mode.
>
> mmCIF looked promising, apropos emacs, when I looked at the spec page at:
> http://www.iucr.org/__data/iucr/cifdic_html/2/cif_mm.dic/Catom_site.html
> because that ATOM data is column-formatted. Cool. However looking at 6LYZ.cif from RCSB's site revealed that the XYZ's were LEFT-justified: http://www.rcsb.org/pdb/files/6LYZ.cif
> which makes me recoil in horror and resolve to use PDB format until someone puts a gun to my head.
>
> Really, guys, if you can put multiple successive spaces to the RIGHT of the number, why didn't you put them to the LEFT of it instead ? Same parsing, better readability.
>
> Phil Jeffrey
> Princeton
> (using the vernacular but deathly serious about protein structure)
|