An interesting one this NIck,
The assumption is being made so far that two entries are likely to relate to
the same site if the grid references are (approximately) the same. A useful
additional approach might be to check that they relate to the same (or
similar) period or type of site as well. This assumes that your original
gazetteer is a consistent list of e.g. lithic scatters or Roman villas. Not
so much use if your gazetteer is just of 'sites in North Xshire'.
In effect there are two tasks here: getting the gazetteer data into a
digital form (which can then be worked on by computer), then the actual work
to concord that data with your own SMR. The second task is a common one in
all sorts of data exchange scenarios. e.g. it would be the same if you were
dealing with imported digital Defence of Britain project data, or PAS data,
LBS data etc.
What we could do with as a community is a general purpose tool for
concording records from different sources based on their similarities
(geospatial, subject matter, reference numbers etc). Perhaps there are
parallels with the techniques used to analyse the simillarities between
artefacts in a collection? It would be a useful adjunct to the various data
exchange initiatives that are our current concerns.
Possibly a task for fish.technical
English Heritage DSU
> -----Original Message-----
> From: Tyler Bell [mailto:[log in to unmask]]
> Sent: 03 September 2003 08:32
> To: [log in to unmask]
> Subject: Re: Paper to digital
> Yes, I'd expand that suggestion a bit more and suggest you
> run a buffer
> search on each NGR to accommodate any variance between
> unmatched NGRs of
> what might be the same site in your system. The trick is of
> course getting
> the NGRs in digital format in the first place: probably best
> done manually
> without messing around with scanning and OCR.
> Dr Tyler Bell
> Technical Director
> Oxford ArchDigital Ltd.
> On Tue, 2 Sep 2003 11:32:19 +0100, Nick Boldrini
> <[log in to unmask]> wrote:
> > Hello folks
> > any ideas on a good way to approach the following problem would be
> > appreciated.
> > Over the eccentric development of our SMR I have been left
> with a number
> > of paper gazetteers of sites, without being able to easily
> tell if these
> > have/have not been digitised.
> > I have a pile of them lurking beside me and need to think
> of a way of
> > dealing with them.
> > The obvious way is to pick a random sample of them, bash in
> the NGR and
> > see if there is anything similar to the Gazetteer entry on
> the digital
> > SMR. This is not a task I relish having better things to do
> with my time
> > (such as watching paint dry....), and would take a long time.
> > So I am wondering if anyone can think of a better/quicker way to do
> > this??
> > I have thought of scanning the Gazetteers in, converting
> the NGR's into
> > Mapinfo table and running a find unmatched type query to see if any
> > aren't overlapped by a monument. For those that are, I would pick a
> > random sample to check that it is not another monument that
> by chance
> > overlays them, but the one I would expect from the Gazetteer.
> > Hopefully by doing this we'll be able to get an idea of
> whether we need
> > to go through one by one to check each record, or can be
> confident that
> > the data has been put in.
> > Does anyone have any experience of trying anything like
> this (scanning
> > etc) or can think of a better way?? Also, what would be a
> > random sample to use to get a good idea of how many records
> > digitised (whichever method I use), or that the overlaying
> monuments are
> > in fact the same ones?? 2% ?? 5%?? More ?? Less??
> > Any ideas appreciated.
> > thanks in advance
> > best wishes
> > Nick Boldrini
> > SMR Computing Officer
> > Heritage Unit
> > North Yorkshire County Council
> > Direct Dial (01609) 532331
> > http://www.northyorks.gov.uk/environment/heritage.shtm#Archaeology
> > North Yorkshire County Council has the right and does inspect E-
> > Government mails sent
> > from and to its computer system.
> > WARNING
> > This E-mail and any attachments may contain information that is
> > confidential or privileged, and is intended solely for the
> use of the
> > named recipient. If you are not the intended recipient,
> please be aware
> > that any disclosure, copying, distribution or any action taken is
> > prohibited and may be unlawful.
> > Any opinions expressed are those of the author and not
> necessarily the
> > view of the Council.
> > North Yorkshire County Council.