New one-day course: Assessing Data Quality and Disclosure Risk in Numeric Data
National Centre for Research Methods (NCRM)
Dear Colleagues,
The UK Data Service is very pleased to announce our first practical training session for NCRM using our exciting new prototype tool, QAMyData, an open source for quality assessing numeric data.
In this hands-on lab-based course, you will learn about the principles of, and tools for, assessing data quality and reviewing disclosure risk in numeric data sources. Data assessment is extremely useful whether it is for wishing to create high quality data for publishing, thereby supporting the transparency and replication agenda (e.g. to meet funder or journal policy), or simply to check unknown data that has been accessed for reuse. The requirements of the GDPR when processing and de-identifying data benefit from quick examination, using tools where possible.
We introduce the key elements of data quality and disclosure risk, including: file checks, data and metadata checks, and direct and indirect identifiers, and introduce two tools to undertake review:
* QAMyData automatically assesses and reports on elements of quality, such as missingness, labelling, duplication, formats, outliers, and direct identifiers, providing a 'health check' for your data. A user can specify and set thresholds in the QAMYData tool, to indicate what one is prepared to accept.
* sdcMicro, a practical R package for checking disclosure risk through examining combinations of key variables.
Practical demonstrations and hands-on exercises will be used throughout the day and we will finish with a session on how to download the software yourself so that you can use them after the workshop, or integrate them into routine data cleaning and processing pipelines when creating, using, reviewing or publishing data.
To sign up for the course please visit the event page on the NCRM website: https://www.ncrm.ac.uk/training/show.php?article=9311
Many thanks,
Louise and colleagues
__________________________
Louise Corti
Director, Collections Development and Data Publishing
__________________________
T +44(0) 1206 872145
E [log in to unmask]<mailto:[log in to unmask]>
W www.data-archive.ac.uk<http://www.data-archive.ac.uk/>
__________________________
UK Data Service
UK Data Archive
University of Essex
Wivenhoe Park
Colchester
Essex CO4 3SQ
Corti, L., Van den Eynden, V., Bishop, L and Woollard, M. Managing and Sharing Research Data: A Guide to Good Practice. Sage Publications Ltd. http://www.uk.sagepub.com/books/9781446267264
Legal Disclaimer: Any views expressed by the sender of this message are not necessarily those of the UK Data Service or the UK Data Archive. This email and any files with it are confidential and intended solely for the use of the individual(s) or entity to whom they are addressed.
You may leave the list at any time by sending the command
SIGNOFF allstat
to [log in to unmask], leaving the subject line blank.
|