Bayesian Networks-based personal data synthesis

2020 
Often, confidentiality problems and a lack of original data, make it challenging to analyze user data carefully. In such situations, synthetic data can be used that is more suitable for testing and training marketing strategies, personalized assistants, or behavior analysis systems than the original data. In this paper, the approach for generating synthetic social media profiles data based on Bayesian networks was analyzed. The personal data synthesis problem was considered as the inference of a joint probability distribution from the oriented probabilistic models like Bayesian networks. The quality of this approach in generating VKontakte (VK is the Russian analog of Facebook) social network data was demonstrated and assessed. The Bayesian network approach has shown itself well in the tasks of deriving joint and marginal data distributions, which has led to the production of high-quality synthetic personal data.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    19
    References
    0
    Citations
    NaN
    KQI
    []