Dear Martyn, Thank you for your additional comments. One of the future remediation projects will be to address carbohydrate-containing entries. Best wishes, Rachel Green ------------------------------------------------------------------------ Rachel Kramer Green, Ph.D. RCSB PDB [log in to unmask] Twitter: https://twitter.com/#!/buildmodels Facebook: http://www.facebook.com/RCSBPDB On 11/17/2013 3:39 PM, MARTYN SYMMONS wrote: > Hmm > So the backstory for the problematic ligand R12 in this thread is that > a sharp-eyed worker at the wwPDB recently spotted that there was an > error compared with the original 1999 paper. Correcting the R12 ligand > is a friendly gesture from the PDB as it appears that the error must > have been the authors' - the atom correcting the R12 ligand has been > inserted by the PDB staff rather than retreived from a deposited > structure. > It is a shame the same helpful approach is not always applied. One > current example I spotted is a separate 'problematic ligand' 5AX which > has been added by the wwPDB to at least four other authors' entries, > starting in 2006 with the latest in 2009. > 5AX is basically a fragment ligand which the PDB software produces if > a NAG has wandered too far from its Asn sidechain during refinement. > If 5AX is generated during the PDB processing of a deposition, then it > really should be highlighted for the authors as a geometric issue - > rather than, as in these cases, being simply added to the coordinates. > Reading the authors' papers for the 5AX-containing entries makes it > clear that they never expected anything other than NAG to appear in > their deposited coordinates. > And given its artifactual production during deposition, 5AX should > never have 'escaped into the wild'. > So if a retrospective fix can be applied to R12 (which similar > in lacking an atom) then it seems to me that, in fairness, a clean up > of the 5AX entries should be arranged. > Yours (not holding his breath), > Martyn > > *From:* Rachel Kramer Green <[log in to unmask]> > *To:* [log in to unmask] > *Sent:* Wednesday, 6 November 2013, 16:49 > *Subject:* Re: [ccp4bb] Problematic PDBs > > Dear Martyn, > > wwPDB staff regularly reviews and remediates PDB data and related > dictionaries such as the Chemical Component Dictionary (CCD). > > As part of our on-going remediation efforts, the chemical components > in the archive are regularly reviewed to ensure the correctness and > the completeness of the chemical representation. Such reviews show > that in some cases, the author has failed to provide a complete > description of the chemistry. To address any such errors, the > definitions are corrected. The chemical name and formula are changed > in the PDB file, but the coordinates are not changed. > > In the case of entry 3CBS, issues were found with the chemical > component definition for its ligand R12. The methyl group was not in > the deposited coordinates and it was missing from the original > definition. In addition, the bond order in one of the carbon-carbon > bonds was incorrectly defined. The CCD definition for R12 was updated > in 2011 to add the methyl group and to correct the bond order based on > information in the primary citation. The coordinates for this PDB > entry were not changed. Therefore, in accordance with wwPDB policy, > the file was not obsoleted. > > Sincerely, > Rachel Green > > Rachel Kramer Green, Ph.D. > RCSB PDB > [log in to unmask] <mailto:[log in to unmask]> > Twitter: https://twitter.com/#!/buildmodels > <https://twitter.com/#%21/buildmodels> > Facebook: http://www.facebook.com/RCSBPDB > On 10/21/2013 6:28 AM, MARTYN SYMMONS wrote: >> As a postscript it might be worth mentioning one problematic ligand >> that suggested to me a way to correct some of the errors mentioned in >> this thread >> R12 is indicated as 9-(4-HYDROXY-2,6-DIMETHYL-PHENYL)-3.... in the >> most recent Coot monomer library. But in the PDB ligand description >> it is >> 9-(4-hydroxy-2,3,6-trimethylphenyl)-3,7-dimethylnona-2,4,6,8-tetraenoic >> acid with an additional carbon C16. To make a long story short this >> ligand was originally deposited missing this extra methyl goup in >> 1999 (as part of 3CBS) and then apparently updated in 2011 by the PDB. >> >> (the relevant lines in the cif are >> <<snip>> >> R12 C16 C16 C 0 1 N N N ? ? ? -6.631 1.502 0.990 C16 R12 44 >> R12 H1 H1 H 0 1 N N N ? ? ? -6.602 1.511 2.080 H1 R12 45 >> R12 H23 H23 H 0 1 N N N ? ? ? -6.422 2.503 0.613 H23 R12 46 >> R12 H24 H24 H 0 1 N N N ? ? ? -7.619 1.186 0.656 H24 R12 47 >> <<snip>> >> >> with the ? ? ? indicating that refined coordinates were not available >> at the time of the update. There was initially an explanation line at >> the end of the cif: >> >> <<snip>> >> R12 "Other modification" 2011-10-25 RCSB CS 'add missing methyl >> group, re-define bond order based on publication' >> <<snip>> >> >> But this has mutated for some reason (premature stop codon?) over the >> past year to the following. >> >> <<snip>> >> R12 "Other modification" 2011-10-25 RCSB >> <<snip>> >> >> Obviously the full correct ligand could not have been incorporated >> into the PDB entry coordinates without these undergoing a full >> obsolete - supersede process (somewhat embarrassing perhaps as one >> author is now a wwPDB PI ;) >> >> But it is frustrating for users of the PDB that in such cases easily >> correctable errors are not actually updated by the authors. Would it >> not be helpful if there were a mechanism to make and track useful >> improvements in deposited structures? - Perhaps suggested by members >> of the community to the authors. >> >> These changes could be considered as 'corrigenda' and could be >> documented and tracked - complete with an explanation of the >> reasoning behind the change and attributing the motivation and origin >> of the improvement. >> >> This would be a good way for the wider scientific community (who >> maybe do not read this bulletin board) to access the best current >> model without the authors suffering the full process of retracting >> and redepositing their PDB entry. The test for obsoleting would then >> be the same as for a paper - that the change invalidates a >> fundamental interpretation of the data. >> >> All the best >> Martyn >> >> *From:* Pavel Afonine mailto:[log in to unmask] >> *To:* [log in to unmask] <mailto:[log in to unmask]> >> *Sent:* Sunday, 20 October 2013, 19:49 >> *Subject:* Re: [ccp4bb] Problematic PDBs >> >> Hello, >> >> just for the sake of completeness: this paper lists a bunch of known >> pathologies (I would not be surprised if they've been remediated by now): >> >> http://www.phenix-online.org/papers/he5476_reprint.pdf >> >> Pavel >> >> >> On Thu, Oct 17, 2013 at 6:51 AM, Lucas <[log in to unmask] >> <mailto:[log in to unmask]>> wrote: >> >> Dear all, >> >> I've been lecturing in a structural bioinformatics course where >> graduate students (always consisting of people without >> crystallography background to that point) are expected to >> understand the basics on how x-ray structures are obtained, so >> that they know what they are using in their bioinformatics >> projects. Practices include letting them manually build a segment >> from an excellent map and also using Coot to check problems in >> not so good structures. >> >> I wonder if there's a list of problematic structures somewhere >> that I could use for that practice? Apart from a few ones I'm >> aware of because of (bad) publicity, what I usually do is an >> advanced search on PDB for entries with poor resolution and bound >> ligands, then checking then manually, hopefully finding some >> examples of creative map interpretation. But it would be nice to >> have specific examples for each thing that can go wrong in a PDB >> construction. >> >> Best regards, >> Lucas >> >> >> >> > > >