Algorithm 1. Framework of proposed P2P

Require: previous model M t 1 , a batch dataset D = { x i , y i } i = 1 n , the probability threshold p, the number of iterators k, learning rate α .

Ensure: A translated dataset D t r a n s , a private domain dataset D p r i , a public domain dataset D p u b .

1. Freeze M t 1

2. Compute the predicted probability distribution for D : P = Softmax ( M t 1 ( x ) )

3. Divide D : D p u b = { ( x i , y i ) | P y i p } , D t r a n s = { ( x i , y i ) | P y i < p }

4. For x D p r i , initialize x * = x + δ with a small random noise δ

5. for i t e r = 1 to k do

6. δ = x * [ L cross-entropy ( θ t 1 ; x * , y ) ]

7. x * = x * α 1 δ 2

8. end for

9. D t r a n s = { ( x * , y ) | y ¯ ( x * , θ t 1 ) = y }

10. Return D t r a n s , D p r i , D p u b