Encoding of Numerical Data for Privacy-Preserving Record Linkage.

2020 
BACKGROUND Privacy-preserving record linkage (PPRL) is the process of detecting dataset entries that refer to the same individual within two independent datasets, without disclosing any personal information. While applied in different fields, it particularly attained importance in the medical sector. One popular PPRL method are Bloom filters. However, Bloom filters were originally used for encoding strings only. OBJECTIVES This paper evaluates an encoding method specifically designed for numerical data and adjusts it for encoding geocoordinates in Bloom filters. METHODS The proposed numerical encoding of geocoordinates is compared to the string-based method by using synthetic data. RESULTS The proposed method for encoding geocoordinates in Bloom filters attains a higher recall and precision than the conventional string encoding. CONCLUSION Numerical encoding has the potential of increasing the record linkage quality of Bloom filters, as well as their privacy level.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    1
    Citations
    NaN
    KQI
    []