Disambiguation data : extracting information from anonymized sources

Privacy protection is an important consideration when releasing medical databases to the research community. We show that while recent advances in anonymization algorithms provide increased levels of protection, it is still possible to calculate approximations to the original data set. In some cases...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	Proceedings. AMIA Symposium. - 1998. - (2001) vom: 11., Seite 144-8
1. Verfasser:	Dreiseitl, S (VerfasserIn)
Weitere Verfasser:	Vinterbo, S, Ohno-Machado, L
Format:	Aufsatz
Sprache:	English
Veröffentlicht:	2001
Zugriff auf das übergeordnete Werk:	Proceedings. AMIA Symposium
Schlagworte:	Journal Article

Beschreibung
Zusammenfassung:	Privacy protection is an important consideration when releasing medical databases to the research community. We show that while recent advances in anonymization algorithms provide increased levels of protection, it is still possible to calculate approximations to the original data set. In some cases, one can even uniquely reconstruct entries in a table before anonymization. In this paper, we demonstrate how knowledge of an anonymization algorithm based on ambiguating data cell entries can be used to undo the anonymization process. We investigate the effect of this algorithm and its reversal on data sets of varying sizes and distributions. It is shown that by using a computationally complex disambiguation process, information on individuals can be extracted from an anonymized data set
Beschreibung:	Date Completed 24.05.2002 Date Revised 13.11.2018 published: Print Citation Status MEDLINE
ISSN:	1531-605X