RMIT University
Browse

Embedding differential privacy in decision tree algorithm with different depths

journal contribution
posted on 2024-11-02, 04:48 authored by Xuanyu Bai, Jianguo Yao, Mingxuan Yuan, Ke DengKe Deng, Xike Xie, Haibing Guan
Differential privacy (DP) has become one of the most important solutions for privacy protection in recent years. Previous studies have shown that prediction accuracy usually increases as more data mining (DM) logic is considered in the DP implementation. However, although one-step DM computation for decision tree (DT) model has been investigated, existing research has not studied the scenarios when the DP is embedded in two-step DM computation, three-step DM computation until the whole model DM computation. It is very challenging to embed DP in more than two steps of DM computation since the solution space exponentially increases with the increase of computational complexity. In this work, we propose algorithms by making use of Markov Chain Monte Carlo (MCMC) method, which can efficiently search a computationally infeasible space to embed DP into DT generation algorithm. We compare the performance when embedding DP in DT with different depths, i.e., one-step DM computation (previous work), two-step, three-step and the whole model. We find that the deep combination of DP and DT does help to increase the prediction accuracy. However, when the privacy budget is very large (e.g., ϵ = 10), this may overwhelm the complexity of DT model, and the increasing trend is not obvious. We also find that the prediction accuracy decreases with the increase of model complexity.

History

Journal

Science China Information Sciences

Volume

60

Number

082104

Start page

1

End page

15

Total pages

15

Publisher

Zhongguo Kexue Zazhishe

Place published

China

Language

English

Copyright

© Science China Press and Springer-Verlag 2017

Former Identifier

2006077672

Esploro creation date

2020-06-22

Fedora creation date

2017-10-10

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC