One suggestion, based on personal scars, is make sure that "names" in the test data set include a wide range of printable and international characters: not just letters. Surprising/depressing how many systems fall over with just an apostrophe in a name and there are parts of the world that convert their character sets to ASCII in much stranger ways than that. I'd be tempted to throw in anything that appears on your keyboard, at least.
Andrew
--
Andrew Cormack
Chief Regulatory Adviser, Janet
t: +44 1235 822302
b: https://community.ja.net/blogs/regulatory-developments
Janet(UK) is a trading name of Jisc Collections and Janet Limited, a not-for-profit company which is
registered in England under No.2881024 and whose Registered Office is at Lumen House, Library
Avenue, Harwell Oxford, Didcot, Oxfordshire, OX11 0SG. VAT No. 614944238
> -----Original Message-----
> From: This list is for those interested in Data Protection issues
> [mailto:[log in to unmask]] On Behalf Of Mike Humphrey
> Sent: 29 September 2014 10:20
> To: [log in to unmask]
> Subject: Re: Anonymising real data
>
> Ian
>
> I suppose I had not really thought it through.
>
> When was talking to the developers I was more concerned that they
> didn't look to use a copy of the live data. My thoughts followed the
> path- take live data, scramble it, produce test data which is
> realistic.
>
> I think a danger of generating test data from scratch is that it
> contains what one expects / plans rather than what has already been
> input ... but then again the test data would include 'tricky' items
> that may not have occurred in the real world yet.
>
> All advice appreciated.
>
> Mike
>
> Mike Humphrey : Information Management Officer
> England and Wales Cricket Board : Lord's Cricket Ground, London, NW8
> 8QZ, England.
>
> Tel: +44(0)20 7432 1274 : Mobile: +44 (0) 7837 365507 : Switchboard:
> +44 (0)20 7432 1200
> Email: [log in to unmask] : Web: http://www.ecb.co.uk
>
>
> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
> All archives of messages are stored permanently and are
> available to the world wide web community at large at
> http://www.jiscmail.ac.uk/lists/data-protection.html
> If you wish to leave this list please send the command
> leave data-protection to [log in to unmask]
> All user commands can be found at
> http://www.jiscmail.ac.uk/help/commandref.htm
> Any queries about sending or receiving messages please send to the
> list owner
> [log in to unmask]
> Full help Desk - please email [log in to unmask] describing your
> needs
> To receive these emails in HTML format send the command:
> SET data-protection HTML to [log in to unmask]
> (all commands go to [log in to unmask] not the list please)
> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
All archives of messages are stored permanently and are
available to the world wide web community at large at
http://www.jiscmail.ac.uk/lists/data-protection.html
If you wish to leave this list please send the command
leave data-protection to [log in to unmask]
All user commands can be found at http://www.jiscmail.ac.uk/help/commandref.htm
Any queries about sending or receiving messages please send to the list owner
[log in to unmask]
Full help Desk - please email [log in to unmask] describing your needs
To receive these emails in HTML format send the command:
SET data-protection HTML to [log in to unmask]
(all commands go to [log in to unmask] not the list please)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|