JiscMail Logo
Email discussion lists for the UK Education and Research communities

Help for CCP4BB Archives


CCP4BB Archives

CCP4BB Archives


CCP4BB@JISCMAIL.AC.UK


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

CCP4BB Home

CCP4BB Home

CCP4BB  April 2011

CCP4BB April 2011

Options

Subscribe or Unsubscribe

Subscribe or Unsubscribe

Log In

Log In

Get Password

Get Password

Subject:

Quick-and-dirty searches of both PDB and EMDB at PDBe

From:

Gerard DVD Kleywegt <[log in to unmask]>

Reply-To:

Gerard DVD Kleywegt <[log in to unmask]>

Date:

Fri, 1 Apr 2011 17:10:49 +0200

Content-Type:

TEXT/PLAIN

Parts/Attachments:

Parts/Attachments

TEXT/PLAIN (180 lines)

Hi all,

As part of its recent winter update, the Protein Data Bank in Europe (PDBe; 
http://pdbe.org/) has improved its facility that allows for tandem searches of 
PDB and EMDB. It was designed to allow users to carry out many of their 
day-to-day searches (without the need to fill out a complex form or learn a 
special query syntax). Simply type what you are looking for, click the SEARCH 
button, and we will do our best to dig up relevant information, be it in the 
PDB, in EMDB or on our website.

QUICK ACCESS TO ENTRIES, SERVICES, SEQUENCES
--------------------------------------------

If you go to the PDBe home page (http://pdbe.org/), you will see a Google-like 
search box in the friendly green banner near the top of the page (just below 
our motto, "Bringing Structure to Biology"). You can use this search box in a 
number of ways:

- type a PDB code (e.g., 1cbs), and you will be taken directly to the summary 
page for that entry. You can type any valid code, even if it's not in the 
current release, so you can use this facility to obtain information about the 
status of entries that have not been released yet (e.g., 2yd0) or entries that 
are no longer in the archive (e.g., theoretical models).

- type a valid EMDB code (e.g., 1607) and you will be taken straight to the 
summary page for that entry.

HINT: if, instead of being taken directly to a summary page for a certain PDB 
or EMDB code, you want to actually search PDB and/or EMDB for references to 
that particular code, simply enclose it in double quotes. For instance, 
searching for 1mi6 will take you to the summary page for PDB entry 1mi6, 
whereas searching for "1mi6" will give you a set of hits in both PDB and EMDB 
that all contain a reference to 1mi6.

- type something resembling a PDBe service or resource name and chances are 
that the name will be recognised and you will be taken straight to that 
service or resource (e.g., autodep, emdep, pdbemotif, pdbepisa, pdbefold, 
pdbechem, quips, portfolio, etc.).

- you can search the protein sequences in the PDB by entering seq: (or 
sequence:) followed by a (partial) amino-acid sequence in one-letter code 
(e.g., seq:GNAAAAKKGSEQESVKEFLAKAKEDFLKKWETPSQNTA). The sequence will be 
compared to all protein sequences in the PDB using FastA, and the results will 
be presented to you for further analysis in the PDBe sequence browser (see 
http://pdbe.org/sequence).

TEXT-BASED SEARCHES
-------------------

Of course you can do general text-based searches of the PDB and EMDB as well - 
just type one or more search terms in the box and hit the SEARCH button.

- If you type a single search term and it gives hits in the PDB, you will get 
a results page with a tree structure on the left which shows in which 
categories the term was found. For instance, if you look for Jones, that could 
be an author, but it could also be part of the name of a molecule (e.g., Bence 
Jones protein). By clicking on an appropriate branch in the tree, you select 
only those entries for which the search term occurs in that data category 
(e.g., author or PDB compound).

- If you type more than one search term, only entries that contain all these 
terms will be selected as hits. For instance, if you search for "kleywegt po4" 
- without the quotes - you will get only one hit, 1CBQ. Note that if you 
enclose your search terms in double quotes, you will only get hits that match 
exactly (i.e., the complete search expression must occur somewhere in the 
entry, not just all of the keywords individually). For instance, searching for 
"HCV NS3 protease" yields 31 hits in the PDB if you enclose the terms in 
double quotes, but 177 hits if you don't.

Note that there are two tabs on the results page - one labelled "PDB entries" 
and the other "EMDB entries". If you do a search for Baumeister, you will get 
14 hits in the PDB. If you click on the "EMDB entries" tab, you will find that 
there are 10 hits in EMDB.

HINT: if you want the EMDB results tab to become active straightaway, preface 
your search term(s) by "emdb:" (without the quotes), e.g. search for 
emdb:saibil and you will immediately get the list of 56 EMDB hits.

SEARCH RESULTS
--------------

The search results are sorted by release date by default, with the most 
recently released entries at the top. This ensures that if you read an 
exciting paper about new ClpC structures, a search for clpc will give you the 
latest entries first. You can change the sort order and criterion with a 
drop-down menu.

Each entry that is found as a hit in a search is shown in a panel that 
contains useful summary information and allows you to launch various searches 
and services with a single mouse-click. If you do a search for hiv-1, for 
example, you will get many hits in the PDB and two dozen in EMDB:

- For each PDB hit you will see: the PDB code, a small image of the structure, 
the resolution (for X-ray and EM structures), the title of the entry, a set of 
PDBprints that provide at-a-glance information about the entry (see 
http://pdbe.org/pdbprints). Two action buttons are also shown: "Entry summary" 
and "Download PDB file" - when pressed, they will do what they promise. If you 
click on "More ..." (or on "Expand all ..." at the top of the results tab) you 
will see even more information, namely the release date, information about the 
publication describing the structure, possible cross-references to EMDB 
entries and four more action buttons ("Quick links to related PDBe services"), 
namely:
   * "Download other files" (takes you to a download page with mmCIF files, 
experimental data files, etc.),
   * "Quaternary structure" (which takes you straight to the PISA results about 
probable assemblies),
   * "Similar structures" (which will automatically launch an SSM/PDBeFold 
search of the PDB to look for structures with similar folds),
   * "Motifs and sites" (which will take you to the PDBeMotif analysis of the 
structure - this may not always work for very recent entries, but we are 
working on solving this issue).

- For each EMDB hit you will see similar information as for the PDB hits. 
Instead of a PDB file, there will be an action button to "Download header 
file". If the EM map/tomogram has been released, there will also be a button 
to download it. If you click "More ..." you will sometimes see "Other EMDB 
entries from this publication" (if one paper describes more than one EMDB 
entry).

NOTE: if you search for hiv-1 today, you will note that the top 2 EMDB hits 
have release dates of 29 March 2011, but the maps are not yet available for 
download. This has to do with the way entries are released in practice (EMDB 
and PDB use a weekly release cycle). Once the release date has arrived, an 
EMDB entry will be flagged (for release) on the first Thursday following the 
release date, which means the map will become available in the next weekly 
release (which will be on the first Wednesday after that Thursday).

LIMITATIONS AND SEARCH TIPS
---------------------------

As you have seen, the Google-like box in the PDBe banner allows you to carry 
out many standard searches quickly and accurately, but with some limitations, 
such as:

- you cannot use regular expressions or operators such as NOT, AND and OR
- you cannot use wildcards (e.g., searching for "vanil*" will not give any 
hits)
- if you search by author name, you will get better results if you only 
provide the surname (searching for "kleywegt gj" only returns one hit, and 
it's not a structure determined by that person)
- at present, there is no way of ranking the results by relevance (the search 
includes PDB keywords and the PubMed abstract, both of which can lead to false 
positives)

In general, the more search terms you enter, the fewer results you will get 
(as they are all required to occur). Useful search terms are (combinations of) 
the following:

- surnames of authors (e.g., rossmann, sixma, allerston, akke, walse)
- names of proteins (e.g., HMG CoA reductase, Lon protease, bacteriorhodopsin)
- (parts of) species names (e.g., "plasmodium falciparum")
- common names of chemical compounds (e.g., retinol, sildenafil, nadph)
- database identifiers from EC, PubMed, UniProt, etc. (e.g., entering 20890284 
will retrieve two structures that have that number as their PubMed identifier; 
searching for CH60_ECOLI will retrieve 30 hits, 28 of which have that as a 
UniProt identifier; searching for 1.1.1.27 will return 105 hits, 63 of which 
contain a lactate dehydrogenase with that EC number)
- (part of) a valid GO term (e.g., "intracellular protein transport", "signal 
sequence", "anchored to membrane")

If you want to carry out more sophisticated searches, in which you can specify 
that you want to search for a term in a particular category of information 
(e.g., looking for "parkinson" in an abstract rather than as an author, or 
looking for "cancer" as part of a reference rather than a keyword), you can 
use the PDBe advanced search facilities. An action button labelled "Advanced 
search" is available between the green PDBe banner and the search results.

                                    -----

We welcome your comments, bug reports and feature requests on the 
"quick-and-dirty" PDBe search facility. Please use the feedback button at the 
top of any PDBe web page.

--Gerard

---
Gerard J. Kleywegt, PDBe, EMBL-EBI, Hinxton, UK
[log in to unmask] ..................... pdbe.org
Secretary: Pauline Haslam  [log in to unmask]

Top of Message | Previous Page | Permalink

JiscMail Tools


RSS Feeds and Sharing


Advanced Options


Archives

May 2024
April 2024
March 2024
February 2024
January 2024
December 2023
November 2023
October 2023
September 2023
August 2023
July 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
December 2022
November 2022
October 2022
September 2022
August 2022
July 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
December 2021
November 2021
October 2021
September 2021
August 2021
July 2021
June 2021
May 2021
April 2021
March 2021
February 2021
January 2021
December 2020
November 2020
October 2020
September 2020
August 2020
July 2020
June 2020
May 2020
April 2020
March 2020
February 2020
January 2020
December 2019
November 2019
October 2019
September 2019
August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007


JiscMail is a Jisc service.

View our service policies at https://www.jiscmail.ac.uk/policyandsecurity/ and Jisc's privacy policy at https://www.jisc.ac.uk/website/privacy-notice

For help and support help@jisc.ac.uk

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager