A Dynamic Social Network Data Publishing Algorithm Based on Differential Privacy

Social network contains the interaction between social members, which constitutes the structure and attribute of social network. The interactive relationship of social network contains a lot of personal privacy information. The direct release of social network data will cause the disclosure of privacy information. Aiming at the dynamic characteristics of social network data release, a new dynamic social network data publishing method based on differential privacy was proposed. This method was consistent with differential privacy. It is named DDPA (Dynamic Differential Privacy Algorithm). DDPA algorithm is an improvement of privacy protection algorithm in static social network data publishing. DDPA adds noise which follows Laplace to network edge weights. DDPA identifies the edge weight information that changes as the number of iterations increases, adding the privacy protection budget. Through experiments on real data sets, the results show that the DDPA algorithm satisfies the user’s privacy requirement in social network. DDPA reduces the execution time brought by iterations and reduces the information loss rate of graph structure.


Introduction
The innovation of the knowledge society has promoted the advent of the era of "Internet +", such as medical data, big data of intelligent city and large education data, which lead the trend of Internet changes.Social network is a new application mode under the Internet background, and the data dissemination in social tial privacy technology, privacy information retrieval technology, and accountability system.The social network privacy protection method mainly studies the static social network data dissemination.

Related Work
Social network Privacy protection technology mainly includes social network data release privacy protection technology based on clustering, and social network data publishing privacy technology based on the graph modification.Terzi [1] used the degree of the node as the background knowledge of the attacker and proposed the k-degree anonymous attack to prevent the node from being anonymous; Lan Lihui [2] proposed a stochastic perturbation method based on differential privacy model and designed LWSPA algorithm to realize the strong protection of edge and variable weight; Zhang Wei [3] proposed a new algorithm for social network data dissemination which based on the hierarchical random graph and satisfies the difference privacy; Chen Chunling [4] classified the privacy protection level, anonymized the nodes through k-grouping, (k, Δd), and reduced the information loss of the social network graph.
If the static social network data release method is applied directly to the background of the dynamic social network, although it can meet the requirements of privacy protection policy, the time overhead will increase, and the information loss of graph structure will be increased.For the dynamic social network data dissemination method, Ying [5] proposed a random method based on edge spectrum graph to anonymously manipulate the information of nodes and edges, which can prevent attackers from attacking with background knowledge as known condition; Bhagat and Cormode [6] [7] proposed a connection prediction algorithm, which simulates the social network changes caused by new nodes or edges are added to the release graph.The algorithm is based on a predictive chart for group anonymity.Cheng [8] proposed a k-isomorphism privacy protection model.In this method, the iterative distribution of the dynamic social network is processed by the Generalization node identification, which can resist the graph structured attack in the data publishing process; Karwa and Chen [9] [10] [11] applied the differential privacy technology to the privacy protection of social network.However, the difference privacy com-plexity of the method graph is high; Guo Caihua [12] summed up the incremental dynamic social network into an incremental sequence of weighted graphs.
This paper constructed a privacy protection model based on K-Anonymous weighted graph increment sequence.Guo Caihua proposed an anonymous algorithm WLKA and HVKA based on the weight chain list, which can prevent attackers from attacking based on node label and weight list.
The key of privacy protection problem of dynamic social network data is how to protect the user's sensitive information effectively under the acceptable time cost, and to ensure the loss rate of weight information is small.The main contributions of this paper are as follows.
1) This paper makes an improvement on the social network differential privacy data publishing algorithm based on MCL (Markov Clustering algorithm) [13].This paper presents the method of data privacy protection which can be applied to the dynamic social network better; 2) In this paper, a strict differential privacy preserving model is introduced.
This paper designs a DDPA algorithm that satisfies ε-difference privacy.The DDPA algorithm identifies the edge weight information that changes as the number of iterations increases and adds the privacy protection budget that satisfies ε.The algorithm achieves privacy protection by injecting noise from the Laplace distribution into the weight of the nodes where the nodes are clustered; 3) This paper experiments on the real social network data set.Comparing with the direct application of MDPA algorithm, the DDPA algorithm satisfies the user's privacy requirement in the social network, reduces the execution time and the loss rate of weight information.

Basic Concept
, V I represents a collection of users in the social network at the time of iterations, and E I represents a collection of the edges of the interaction between users in the social network at the time of iterations,  represents the collection of social network graphs which has added privacy pro- The dynamic social Network graph shows in Figure 1.G = (G I1 , G I2 ). Figure 1(b) is compared to Figure 1(a), which increases the edge between node 2, node 6, deletes the edge between node 5 and node 8, and the edge between node 8 and node 11.

The Edge Weight Information of Ternary Group
Definition 2. The edge weight information of ternary group Defining a ternary group T = (i, j, x), i, j represents the node number in a social network diagram, x represents the weight value of the edge.x is 0 when there is no connection between nodes.T = (1, 2, 5) indicates that there is a side between node 1 and node 2, with a weighting value of 5.

Sensitivity
Definition 3. Sensitivity: q ∆ is the sensitivity of the query function, which is defined as follows: ( ) ( ) Data sets D 1 and D 2 differ by only one element.In this paper, we suppose two social network data sets G I1 and G I2 .There is only one different element between data sets G I1 and G I2 .Global sensitivity is set to the maximum difference weight that exists on the difference edge Δf = W max .The social network graph G I1 is initialized, and then the Markov clustering is carried out.The clustering result set V of node cluster is obtained, and then the weight information of each cluster is recorded.According to the order of clustering set, the weights are composed of ternary group

ε-Weight Vector
, , , , , , , , , n X x x x =  , according to the query sensitivity Δf and privacy budget parameters ε, constructing a noise vector with a Laplace distribution of length d <Lap(Δf/ε)> X .
( ) , P i is a weight vector satisfying ε-differential privacy.

Weight Information Loss of Graph Definition 5. Weight information loss rate of graph
There are ( ) ( ) . G′ is added the privacy protection.The loss rate of weight information due to the change of weight is: , , ′ is the value of weight which has added the privacy protection.W(G) is the sum of all edge weights of network graphs.

A Dynamic Social Network Data Publishing Algorithm Based on Differential Privacy
Applying static social network data privacy publishing algorithm directly to dynamic social network can cause high execution time and large information loss rate of graph structure.This paper makes an improvement on differential privacy network data publishing based on MCL, and designs a dynamic social network data publishing algorithm DDPA which satisfies the ε-difference privacy.
In order to introduce the algorithm flow of DDPA, the MDPA algorithm is decomposed into two parts that include algorithm 1 and algorithm 2.
Algorithm 1: Input the initial social network graph G, expansion parameter e, inflation parameter p, outputs the ε-weight vector of the initial graph G.
Algorithm 2: Output the ε-weight vector of the initial graph G, privacy budget parameters ε, output the privacy preserving graph G'.

Basic Idea of DDPA
The distribution of social network data has dynamic characteristics, and the graph structure is updated iteratively.DDPA algorithm is an improvement of privacy protection algorithm in static social network data publishing.MDPA algorithm adds noise to the whole network graph, but DDPA algorithm adds noise to the changed network edge weights.DDPA algorithm identifies the edge weight information that changes as the number of iterations increases, and adds the privacy protection budget that satisfies ε.Therefore, DDPA algorithm greatly reduces the execution cost of the algorithm and reduces the loss rate of weight information.

Basic Flow of DDPA
The algorithm steps are described as follows: Step 3 Execute algorithm 1, traverse G I+1 , build weights information ternary group T I+1 (i, j, x) and Vector X I+1 Step 4 Compare T I and T I+1 , recognize ternary group T c which belongs to modified edges, generate the weight vector X c corresponding to T c Step 5 Compare I T ′ and T I+1 , recognize ternary group T a which belongs to add edges, generate the weight vector X a corresponding to T a Step 6 Taking S i as sampling frequency, make X c to random sampling.Generating Laplace noise N c that satisfies differential privacy Step 7 Taking S i as sampling frequency, make X a to random sampling.Generating Laplace noise N a that satisfies differential privacy Step 8 Using X c instead of the changed edge information in the T I 's weight vector I X ′ , add the edge information increment to Step 9 According to the query sensitivity Δf and the privacy budget parameter ε, constructing a vector of Laplace distribution with length d: <Lap(Δf/ε)> X Step 10 Generating a vector Step 11 Distribute social Network graph 1 I G + ′ , which has protected in the I+1th Iteration

Privacy Analysis of DDPA
The DDPA algorithm of dynamic social network data release is the improvement of the social network differential privacy data publishing method based on the Markov clustering algorithm in the static social network.The MDPA algorithm has proved that it satisfies ε-difference privacy.This paper only needs to prove that after recognizing the change of the edge weight information, the ε-Weight vector DDPA (G I ) satisfies the differential privacy.
According to the definition of differential privacy, we suppose two dynamic social network data sets G I1 and G I2 .There is only one different element between data sets G I1 and G I2 .Given a privacy algorithm DDPA, Range (DDPA) is the range of DDPA.If any outputs of the DDPA algorithm on data sets G I1 and G I2 satisfy the following inequality, we can say that the DDPA algorithm satisfies ε-differential privacy.
( ) ( ) Pr DDPA e Pr DDPA Proof: Set i i p P ∈ , P i is the same as the X i dimension.From the conditional probability, , Pr Pr e

Experimental Setup
Experimental environment is: Intel(R) Core(TM) i5-4590 CPU @ 3.30 GHz 4.00 GB of Memory.The operating system is Microsoft Windows 7 ultimate.The programming languages are C++ and Matlab.The experimental data is Lesmis which is a weighted social network graph [14].Lesmis has 77 nodes and 254 sides.In order to reflect the dynamic characteristics of social network, this paper randomly deletes 5% nodes, then adds 5% nodes from the initial social network graph to form a new social network map.In the experimental scheme, the node of social network diagram is invariant, that is, regardless of the node reduction or increase.Only the change of side information is considered.

Experimental Result
The experiment of this paper contains three parts.The first part of the experi-Figure 3. Loss rate of weight information.

Experimental Comparison
This experiment is a comparison between the MDPA algorithm and the DDPA algorithm in the execution time and the weight information loss rate.The test results are shown in Figure 4.
In the experiment, the values of ε are 0.05, 0.1, 1 and 10.The experimental results in Figure 4(a) show that when the privacy budget parameter ε takes the same value, the execution time of the DDPA algorithm is significantly lower than the MDPA algorithm at the same iteration time, and as the number of iterations increases, the execution time of the DDPA algorithm is also less than the MDPA algorithm.The experimental results in Figure 4(b) show that when the privacy budget parameter ε takes the same value, the weight loss rate of the DDPA algorithm and the MDPA algorithm will increase gradually with the increase of iterative times.Because of the increase of the number of iterations, the Laplace noise satisfying the ε-difference privacy gradually increases, so the weight loss rate will increase correspondingly.However, the weight loss rate of the DDPA algorithm is always lower than the MDPA algorithm, which shows that the DDPA algorithm is superior to the MDPA algorithm in the dynamic social network.

Conclusion
Aiming at the improvement of privacy protection algorithm in static social network data publishing, a dynamic social network data publishing algorithm based on differential privacy is designed.This paper recognizes the edge weight information which changes with the increase of the number of iterations, adds the privacy protection budget satisfying ε, reduces the time cost of the algorithm, and guarantees the reduction of the loss rate of weight information.The limitation of this paper is that we only consider the increase or decrease of edge and the change of the edge weight.The change of the node makes the privacy protection budget more complicated.The future work is to deeply study the situation of the change of node.The method of this paper will enhance the degree of privacy protection and reduce the loss rate of weight information under the condition of satisfying the privacy budget.

Z
. P. Liu et al.DOI: 10.4236/jis.2017.84021329 Journal of Information Security network has great research value and application significance.The user's large number of personal privacy information may be leaked when social network data is analyzed and excavated.Social networks are evolving and changing that named dynamic social networks.The dynamic social network is concerned with the dynamic change caused by the change of time in the interaction between social members.The privacy strategy of the static social network data release usually cannot adapt to the dynamic development of social network efficiently.It has far-reaching theoretical significance and practical value in the field of information security and network space security.Existing privacy protection technologies include anonymous technology, data encryption technology, differen-

Figure 1 .
Figure 1.Dynamic social network.(a) Social Network for the first iteration G I1 .(b) Social Network for the second iteration G I2 .
Definition 4. ε-Weight vector Z. P. Liu et al.DOI: 10.4236/jis.2017.84021332 Journal of Information Security Input: Social Network graph G I in the Ith Iteration, Social Network graph I G ′ which has protected in the Ith Iteration, Social Network graph G I+1 in the I+1th Iteration, privacy budget parameter ε, expansion parameter e, Inflation parameter p; Output: Social Network graph 1 I G + ′ which has protected in the I+1th Iteration Step 1 Execute algorithm 1, traverse G I , build the weight information ternary group T I (i, j, x) and Vector X I Step 2 Execute algorithm 2, create a social network graph of privacy protection I G ′ , the weight information ternary group I T ′ and Vector

Figure 4 .
Figure 4. Contrast experiment between DDPA and MDPA.(a) Comparison experiment of DDPA and MDPA in Execution time.(b) Comparison experiment of DDPA and MDPA in Weight information loss rate.