JISCMail - CCPNMR Archives

Email discussion lists for the UK Education and Research communities
Subscriber's Corner
Email Lists
CCPNMR Archives

CCPNMR@JISCMAIL.AC.UK

View:

Message:
[
First
Last
]
By Topic:
[
First
Last
]
By Author:
[
First
Last
]
Font:
Proportional Font
		LISTSERV Archives
		CCPNMR Home
		CCPNMR September 2008
Options

Subscribe or Unsubscribe
Get Password
Subject:
Re: problems loading spectra in v2
From:
Rasmus Fogh <[log in to unmask]>
Reply-To:
CcpNmr software mailing list <[log in to unmask]>
Date:
Thu, 4 Sep 2008 19:00:23 +0100
Content-Type:
MULTIPART/MIXED
Parts/Attachments:
TEXT/PLAIN (47 lines) , Io.py (1 lines)
Dear Mark,

I have found a bug that *may* have somethign to do with the problem. It si
a bit hard to see, but it is certainly a bug, and in the right part of the
code.

Could you try to replace ccp/general/Io.py with the attached file (keep
the old one for reference), and let me know what happens?

Got to go.

Till tomorrow,

Rasmus

---------------------------------------------------------------------------
Dr. Rasmus H. Fogh                  Email: [log in to unmask]
Dept. of Biochemistry, University of Cambridge,
80 Tennis Court Road, Cambridge CB2 1GA, UK.     FAX (01223)766002

On Thu, 4 Sep 2008, Mark Pfuhl wrote:

> Thanks for the comments. I have now done all upgrades that are available but
> I still get the same error message even so the spectrum now appears.
>
> Brians python test revealed the problem. The path that is printed out with
> the error message is indeed not there. And looking at it I should have
> noticed straight away that something was odd:
>
> /mnt/data/mp84/nmr/LZ5K309C_hcn_b800/2/pdata/LZ5K309C_hcn_b800/2/pdata/1/2rr
>
> contains the dataset name LZ5K309C_hcn_b800 twice which is of course
> nonsense. The correct path actually is :
>
> /mnt/data/mp84/nmr/LZ5K309C_hcn_b800/2/pdata/1/2rr
>
> The strange thing is that the path is displayed correctly in the initial
> Open spectrum window. After clicking on Open spectra the Verify Spectrum
> window shows sensible parameters for the spectrum in the Verify Referncing
> tab. More importantly in the Verify File Details tab the path to the 2rr
> file is shown correctly.
>
> It looks as if one part of analysis gets it right and the spectrum is indeed
> picked up, But another part makes a mess of the path and generates the error
> message. Very odd.
>


"""

======================COPYRIGHT/LICENSE START==========================



Io.py: General I/O code for CCPN



Copyright (C) 2008 Wayne Boucher, Rasmus Fogh, Tim Stevens and Wim Vranken (University of Cambridge and EBI/MSD)



=======================================================================



This library is free software; you can redistribute it and/or

modify it under the terms of the GNU Lesser General Public

License as published by the Free Software Foundation; either

version 2.1 of the License, or (at your option) any later version.

 

A copy of this license can be found in ../../../license/LGPL.license

 

This library is distributed in the hope that it will be useful,

but WITHOUT ANY WARRANTY; without even the implied warranty of

MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU

Lesser General Public License for more details.

 

You should have received a copy of the GNU Lesser General Public

License along with this library; if not, write to the Free Software

Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA





======================COPYRIGHT/LICENSE END============================



for further information, please contact :



- CCPN website (http://www.ccpn.ac.uk/)

- MSD website (http://www.ebi.ac.uk/msd/)



=======================================================================



If you are using this software for academic purposes, we suggest

quoting the following references:



===========================REFERENCE START=============================

R. Fogh, J. Ionides, E. Ulrich, W. Boucher, W. Vranken, J.P. Linge, M.

Habeck, W. Rieping, T.N. Bhat, J. Westbrook, K. Henrick, G. Gilliland,

H. Berman, J. Thornton, M. Nilges, J. Markley and E. Laue (2002). The

CCPN project: An interim report on a data model for the NMR community

(Progress report). Nature Struct. Biol. 9, 416-418.



Wim F. Vranken, Wayne Boucher, Tim J. Stevens, Rasmus

H. Fogh, Anne Pajon, Miguel Llinas, Eldon L. Ulrich, John L. Markley, John

Ionides and Ernest D. Laue (2005). The CCPN Data Model for NMR Spectroscopy:

Development of a Software Pipeline. Proteins 59, 687 - 696.



Rasmus H. Fogh, Wayne Boucher, Wim F. Vranken, Anne

Pajon, Tim J. Stevens, T.N. Bhat, John Westbrook, John M.C. Ionides and

Ernest D. Laue (2005). A framework for scientific data modeling and automated

software development. Bioinformatics 21, 1678-1684.



===========================REFERENCE END===============================

"""

#

# Convenient I/O functions

#



import os, urllib, re



from memops.universal import Io as uniIo

from memops.general.Implementation import ApiError

from memops.general.Io import findCcpXmlFile

from memops.api import Implementation

from memops.format.xml import XmlIO



from ccp.general.Constants import chemCompServer, chemCompWebPath

from ccp.general.Constants import standardResidueCcpCodes

from ccp.general.Util import setCurrentStore



def getDataPath(*args):

  

  """

  Gives location of data path. Extra args are added on as extra directories

  Result ends with the last arg (which might be either a file or a dictionary)

  """

  dataPath = uniIo.joinPath(uniIo.getTopDirectory(),'data',*args)

  return dataPath





def getChemComp(project, molType, ccpCode, download=True, showError=None,

                partialLoad=False):

  """ get ChemComp corresponding to molType,ccpCode, 

  looking 1) in memory, 2) in Repositories on lookup path,

  3) in allChemComps directory, 4) downloading from msd ChemComp server.

  For 3) and 4) save new ChemComp in first Repository on PAckageLocator lookup path

  Do 4) only if download==True

  

  showError is an optional GUI error handler that can be passed in.

  partialLoad controls if only the TopObject (default) or the entire file is loaded

  

  Optimised to avoid mass reading.

  """

  

  

  # First get it if already loaded

  chemComp = project.getByNavigation(('chemComps',(molType,ccpCode)))

  

  if chemComp is None:

    # try to load it from an existing repository - avoiding mass loading

    packageName = 'ccp.molecule.ChemComp'  

    chemCompFileSearchString = "%s+%s+*.xml" % (molType,uniIo.getCcpFileString(ccpCode))

    

    chemCompXmlFile = findCcpXmlFile(project, packageName, chemCompFileSearchString)

    if chemCompXmlFile:

      chemComp = XmlIO.loadFromFile(project, chemCompXmlFile, partialLoad=partialLoad)



    if chemComp is None:

      # try to get it from allChemComps directory, if any, or to download it.

      chemCompPath = getDataPath('allChemComps')

      ccLocator = (project.findFirstPackageLocator(targetName=packageName) or

                   project.findFirstPackageLocator(targetName='any'))

      repository = ccLocator.findFirstRepository()

      

      fileFound = getChemCompXmlFile(repository, chemCompPath, molType, ccpCode,

                                    showError=showError)

      if not fileFound and download:

        fileFound = downloadChemCompXmlFile(repository, molType, ccpCode,

                                            showError=showError)

      if fileFound:

        chemComp = XmlIO.loadFromFile(project, fileFound, 

                                      partialLoad=partialLoad)

  #

  return chemComp

  

  

def getChemCompXmlFile(repository, chemCompPath, molType, ccpCode, 

                       showError=None):

  """

  Fetch chemComp 'molType', 'ccpCode' to local repository 'repository'

  from repository defined by chemCompPath

  showError is an error display function, passed in were appropriate for GUI

  contexts

  Returns name of copied file, or None if unsuccessful

  """

  

  if showError is None:

    showError = uniIo.printError

  

  

  result = None

  # Try to find file...

  

  import glob

  

  chemCompFileSearchString = "%s+%s+*.xml" % (molType,uniIo.getCcpFileString(ccpCode))

  # TODO HERE HAVE TO USE repository.getFileLocation name to get path to chemCompFileName?

  chemCompFileSearchPath = os.path.join(chemCompPath,'ccp','molecule','ChemComp')

  

  chemCompFileNameMatches = glob.glob(os.path.join(chemCompFileSearchPath,chemCompFileSearchString))

  

  if chemCompFileNameMatches:

    if len(chemCompFileNameMatches) > 1:

      errorText = "Error: multiple matches found for chemComp %s.%s - taking last one." % (molType,ccpCode)

      showError("Multiple ChemComp matches", errorText)

    

    chemCompFilePath = chemCompFileNameMatches[-1]

    (chemCompFileDir,chemCompFileName) = os.path.split(chemCompFilePath)

  

    #

    # Copy file if found...

    #



    if os.path.exists(chemCompFilePath):

      saveChemCompPath = repository.getFileLocation('ccp.molecule.ChemComp')

      if not os.path.exists(saveChemCompPath):

        os.makedirs(saveChemCompPath)

      import shutil

      saveChemCompFilePath = os.path.join(saveChemCompPath,chemCompFileName)

      shutil.copy(chemCompFilePath,saveChemCompFilePath)

      result = saveChemCompFilePath

    

      print "  ChemComp file %s copied from repository %s..." % (chemCompFileName,chemCompPath)

  

  #

  return result





def downloadChemCompXmlFile(repository, molType, ccpCode, showError=None):

  """

  Fetch chemComp 'molType', 'ccpCode' to local repository 'repository'

  from chemCompServer

  showError is an error display function, passed in were appropriate for GUI

  contexts

  Returns name of copied file, or None if unsuccessful

  """

  

  if showError is None:

    showError = uniIo.printError

    

  result = None

    

  chemCompXmlFilePatt = re.compile("(" + molType + "\+" + 

                                   uniIo.getCcpFileString(ccpCode) 

                                   + "\+[^\s\"\>]+\.xml)")



  try:

    urlLocation = "http://%s%s" % (chemCompServer, os.path.join(chemCompWebPath,

                                                                molType))

    r1 = urllib.urlopen(urlLocation)

    try:

      data = r1.read()

      r1.close()

      

      dataLines = data.split("\n")

      chemCompXmlFile = None

      for dataLine in dataLines:

        chemCompSearch = chemCompXmlFilePatt.search(dataLine)

        if chemCompSearch:

          chemCompXmlFile = chemCompSearch.group(1)

          

      if chemCompXmlFile:

        urlLocation = os.path.join(urlLocation,chemCompXmlFile)

        r2 = urllib.urlopen(urlLocation)

  

        try:

          data = r2.read()

          r2.close()

  

          try:

            saveChemCompPath = repository.getFileLocation('ccp.molecule.ChemComp')

            if not os.path.exists(saveChemCompPath):

              os.makedirs(saveChemCompPath)

  

            chemCompFile = os.path.join(saveChemCompPath,chemCompXmlFile)

            fout = open(chemCompFile,'w')

            fout.write(data)

            fout.close()

  

            print ("Downloaded chemComp %s, %s from server %s, written to file %s!"

                   % (molType,ccpCode,chemCompServer,chemCompFile))

            result = chemCompFile

  

          except IOError, e:

            showError("Cannot write file", 

                      "Cannot write chemComp XML file %s, %s: %s" 

                      % (molType,ccpCode,str(e)))

  

        except IOError, e:

          showError("Cannot read file", "Cannot read chemComp %s, %s: %s" 

                    % (molType,ccpCode,str(e)))

      

        

      else:

        showError("Cannot find file", "Cannot find chemComp XML file %s, %s." 

                  % (molType,ccpCode))

      

    except IOError, e:

      showError("Cannot read directory", 

                "Cannot read directory information for molType %s: %s" 

                % (molType,ccpCode,str(e)))



  except IOError, e:

    showError("No connection", 

              "Cannot connect to download server %s, or file does not exist...: %s " 

              % (chemCompServer,str(e)))

  #

  return result

  



def getChemCompCoord(project, sourceName, molType, ccpCode):

  """ get ChemCompCoord corresponding to sourceName,molType,ccpCode, 

  looking 1) in memory, 2) in Repositories on lookup path

  

  Optimised to avoid mass reading.

  """

  

  

  # First get it if already loaded

  chemCompCoord = project.getByNavigation(('chemCompCoords',(sourceName,molType,ccpCode)))

  

  if chemCompCoord is None:

    # try to load it from an existing repository - avoiding mass loading

    packageName = 'ccp.molecule.ChemCompCoord'  

    chemCompCoordFileSearchString = "%s+%s+%s+*.xml" % (uniIo.getCcpFileString(sourceName),

                                                        molType, uniIo.getCcpFileString(ccpCode))

    

    chemCompCoordXmlFile = findCcpXmlFile(project, packageName, chemCompCoordFileSearchString)

    if chemCompCoordXmlFile:

      chemCompCoord = XmlIO.loadFromFile(project, chemCompCoordXmlFile, partialLoad=False)

  #

  return chemCompCoord





def getStdChemComps(project,molTypes=None):



  chemComps = []



  if not molTypes:

  

    molTypes = ['protein','RNA','DNA']



  for molType in molTypes:

  

    if standardResidueCcpCodes.has_key(molType):

  

      for ccpCode in standardResidueCcpCodes[molType]:

        chemComp = getChemComp(project, molType, ccpCode, download=False)

        if chemComp:

          chemComps.append(chemComp)



  return chemComps





def setDataSourceDataStore(dataSource, dataUrlPath, localPath, 

                           dataLocationStore=None, dataUrl=None):

  

  #

  # Get DataLocationStore

  #

  

  if not dataLocationStore:

    

    setCurrentStore(dataSource.root,'DataLocationStore')

    dataLocationStore = dataSource.root.currentDataLocationStore

  

  #

  # Get (or create) DataUrl

  #

  

  # TODO should this search function go elsewhere?

  if not dataUrl:

    for tmpDataUrl in dataLocationStore.dataUrls:

      if tmpDataUrl.url.dataLocation == dataUrlPath:

        dataUrl = tmpDataUrl

    

    if not dataUrl:

      dataUrlPath = uniIo.normalisePath(dataUrlPath)

      dataUrl = dataLocationStore.newDataUrl(url = Implementation.Url(path = dataUrlPath))

      

  #

  # Create a BlockedBinaryMatrix. TODO: could be other classes that are set up this way - rename func and make general,, pass in class?

  #  

  localPath = uniIo.normalisePath(localPath)

  blockedBinaryMatrix = dataLocationStore.newBlockedBinaryMatrix(path=localPath, 

                                                                 dataUrl=dataUrl)

  

  """

  TODO Set here as well, or do this later after returning object:



blockSizes      Int      0..*     Block sizes in dimension order  

complexStoredBy   ComplexStorage   1..1   The ordering of real and imaginary parts of hypercomplex numbers in the data matrix. See ComplexStorage type for details  

hasBlockPadding   Boolean   1..1   Are data padded to fill all blocks completely? Alternatively incomplete blocks store only the actual data.  

headerSize   Int   1..1   Header size in bytes  

isBigEndian   Boolean   1..1   Are data big-endian (alternative little-endian).  

isComplex   Boolean   0..*   Are numbers complex (if True) or real/integer (if False).  

nByte   Int   1..1   Number of bytes per number  

numPoints   Int   0..*   number of points for each matrix dimension - also defines dimensionality of matrix. The number of points is the same for real or complex data, in the sense that n complex points require 2n real numbers for storage.  

numRecords   Int   1..1   Number of matrix records in file. All other information in the object describes a single record.  

numberType   NumberType   1..1   Type of numbers held in matrix  

  

  """

  

  dataSource.dataStore = blockedBinaryMatrix

  

  return blockedBinaryMatrix





def getDataSourceFileName(dataSource):



  dataStore = dataSource.dataStore



  if not dataStore:

    return None



  return dataStore.fullPath





def setDataSourceFileName(dataSource, fileName):



  dataStore = dataSource.dataStore



  if dataStore is None:

    raise ApiError('dataStore is None')



  preferDataUrls=(dataStore.dataUrl,)

  (dataUrl, filePath) = getDataStoringFromFilepath(dataSource.root,

                               fullFilePath=fileName,

                               preferDataUrls=preferDataUrls,

                               dataLocationStore=dataStore.dataLocationStore)



  dataStore.dataUrl = dataUrl

  dataStore.path = filePath





def getDataStoringFromFilepath(memopsRoot, fullFilePath, preferDataUrls=None,

                               dataLocationStore=None, keepDirectories=1):

  

  # make absolute,, normalised path

  fullFilePath = uniIo.normalisePath(fullFilePath, makeAbsolute=True)

  

  dataUrl, filePath = findDataStoringFromFilepath(memopsRoot, fullFilePath, 

                                                  preferDataUrls,

                                                  dataLocationStore,

                                                  keepDirectories)

  

  if dataUrl is None:

  

    urlPath = uniIo.normalisePath((fullFilePath[:-len(filePath)]))

    dataLocationStore = memopsRoot.currentDataLocationStore

    dataUrl = dataLocationStore.newDataUrl(

                                   url=Implementation.Url(path=urlPath))

    dataUrl.name = 'auto-%s' % dataUrl.serial

  #

  return (dataUrl, filePath)



def findDataStoringFromFilepath(project, fullFilePath, preferDataUrls=None,

                               dataLocationStore=None, keepDirectories=1):

  """ Get DataUrl and relative filePath from normalised absolute filePath

  Uses heuristics to select compatible DataUrl from existing ones.

  sisterObjects is a collection of objects with a dataStore link - 

  DataUrls in use for sisterObjects are given preference in the

  heuristics.

  uses dataLocationStore or current dataLocationStore

  If no compatible DataUrl is found the routine returns dataUrl None

  and the file name plus the lowest keepDirectories directories 

  as the filePath

  """

  # NB fullFilePath *must* be absolute her for code to work properly

  # 

  if not os.path.isabs(fullFilePath):

    raise ApiError(

     "findDataStoringFromFilepath called with non-absolute file name %s"

     % fullFilePath)

  

  # get DataLocationStore

  if not dataLocationStore:

    setCurrentStore(project,'DataLocationStore')

    dataLocationStore = project.currentDataLocationStore

  

  # get DataUrl that match fullFilePath

  dataUrls = []

  for dataUrl in dataLocationStore.dataUrls:

    dirPath = uniIo.normalisePath(dataUrl.url.path)

    if fullFilePath.startswith(dirPath):

      lenPath = len(dirPath)

      ss = fullFilePath

      while len(ss) > lenPath:

        ss,junk = uniIo.splitPath(ss)

      if ss == dirPath:

        # DataUrl path matches file path

        dataUrls.append(dataUrl)

  

  # process result

  if dataUrls:

    if preferDataUrls:

      # look for DataUrls that are in use with related objects

      ll = [x for x in dataUrls if x in preferDataUrls]

      if ll:

        dataUrls = ll

        

    if len(dataUrls) == 1:

      # only one DataUrl - use it

      dataUrl = dataUrls[0]

    else:

      # use DataUrl with longest path

      ll = [(len(dataUrl.url.path),dataUrl) for dataUrl in dataUrls]

      ll.sort()

      dataUrl = ll[-1]

    

    # get filePath

    ss = uniIo.joinPath(dataUrl.url.path, '') # adds file separator to end

    filePath = fullFilePath[len(ss):]

  

  else:

    dataUrl = None

    ll = []

    ss = fullFilePath

    for dummy in range(keepDirectories + 1):

      ss,name = os.path.split(ss)

      ll.append(name)

    ll.reverse()

    filePath = uniIo.joinPath(*ll)

  

  #

  return (dataUrl, filePath)
Top of Message | Previous Page | Permalink
JiscMail Tools

Files Area | help
RSS Feeds and Sharing

Search Archives

Advanced Options