項目總結四：神經風格遷移項目（Art generation with Neural Style Transfer）

本文轉載自查看原文 2018-04-22 11:30 1263 項目總結

1、項目介紹

神經風格轉換 (NST) 是深部學習中最有趣的技術之一。它合並兩個圖像, 即內容圖像 C（content image）和樣式圖像S（style image）, 以生成圖像 G（generated image）。生成的圖像 G 將圖像 C 的內容與圖像S的樣式組合在一起。

2、模型

利用遷移學習的技巧，模型采用預訓練的VGG19網絡。預訓練的模型來自 MatConvNet. http://www.vlfeat.org/matconvnet/pretrained/ 。模型結構如下：

（1）模型結構示例圖：

（2）本項目用的VGG19網絡的結構

{'input': <tf.Variable 'Variable:0' shape=(1, 300, 400, 3) dtype=float32_ref>,
 'conv1_1': <tf.Tensor 'Relu:0' shape=(1, 300, 400, 64) dtype=float32>, 
 'conv1_2': <tf.Tensor 'Relu_1:0' shape=(1, 300, 400, 64) dtype=float32>,
 'avgpool1': <tf.Tensor 'AvgPool:0' shape=(1, 150, 200, 64) dtype=float32>,
 'conv2_1': <tf.Tensor 'Relu_2:0' shape=(1, 150, 200, 128) dtype=float32>, 
 'conv2_2': <tf.Tensor 'Relu_3:0' shape=(1, 150, 200, 128) dtype=float32>, 
 'avgpool2': <tf.Tensor 'AvgPool_1:0' shape=(1, 75, 100, 128) dtype=float32>, 
 'conv3_1': <tf.Tensor 'Relu_4:0' shape=(1, 75, 100, 256) dtype=float32>, 
 'conv3_2': <tf.Tensor 'Relu_5:0' shape=(1, 75, 100, 256) dtype=float32>, 
 'conv3_3': <tf.Tensor 'Relu_6:0' shape=(1, 75, 100, 256) dtype=float32>, 
 'conv3_4': <tf.Tensor 'Relu_7:0' shape=(1, 75, 100, 256) dtype=float32>,
 'avgpool3': <tf.Tensor 'AvgPool_2:0' shape=(1, 38, 50, 256) dtype=float32>,
 'conv4_1': <tf.Tensor 'Relu_8:0' shape=(1, 38, 50, 512) dtype=float32>, 
 'conv4_2': <tf.Tensor 'Relu_9:0' shape=(1, 38, 50, 512) dtype=float32>, 
 'conv4_3': <tf.Tensor 'Relu_10:0' shape=(1, 38, 50, 512) dtype=float32>, 
 'conv4_4': <tf.Tensor 'Relu_11:0' shape=(1, 38, 50, 512) dtype=float32>, 
 'avgpool4': <tf.Tensor 'AvgPool_3:0' shape=(1, 19, 25, 512) dtype=float32>, 
 'conv5_1': <tf.Tensor 'Relu_12:0' shape=(1, 19, 25, 512) dtype=float32>, 
 'conv5_2': <tf.Tensor 'Relu_13:0' shape=(1, 19, 25, 512) dtype=float32>,
 'conv5_3': <tf.Tensor 'Relu_14:0' shape=(1, 19, 25, 512) dtype=float32>,
 'conv5_4': <tf.Tensor 'Relu_15:0' shape=(1, 19, 25, 512) dtype=float32>,
 'avgpool5': <tf.Tensor 'AvgPool_4:0' shape=(1, 10, 13, 512) dtype=float32>}

3、成本函數

（1）內容代價函數

首先把圖片由3D volume展開為2D matrix，如下圖：

計算內容代價函數。分別以G和S兩圖片作為輸入時，如果神經網絡某一層的激活值相似，那么就意味着兩個圖片的內容相似。

（2）風格代價函數

首先計算某一層的Gram矩陣：

計算風格代價函數。分別以G和S兩圖片作為輸入時，如果神經網絡某一層的各個通道之間激活值相關系數高，那么就意味着兩個圖片的內容相似。

實際上，如果你對各層都使用風格代價函數，會讓結果變得更好。計算公式如下：

把內容代價函數和風格代價函數組合到一起，就得到了代價函數：

4、模型優化算法與訓練目標

# define optimizer (1 line)
optimizer = tf.train.AdamOptimizer(2.0)
 
# define train_step (1 line)
train_step = optimizer.minimize(J)

5、輸入輸出數據

輸入數據：content_image、style_image、generated_image
輸出數據：generated_image

6、總結

Neural Style Transfer is an algorithm that given a content image C and a style image S can generate an artistic image
It uses representations (hidden layer activations) based on a pretrained ConvNet.
The content cost function is computed using one hidden layer's activations.
The style cost function for one layer is computed using the Gram matrix of that layer's activations. The overall style cost function is obtained using several hidden layers.
Optimizing the total cost function results in synthesizing new images.

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Art: Neural Style Transfer 神經風格轉換Neural Style Transfer a review 神經風格轉換 (Neural-Style-Transfer-Papers) fast neural style transfer圖像風格遷移基於tensorflow實現 Gitlab批量遷移項目 PyCharm Python遷移項目圖像風格轉換（Style Transfer | 風格遷移綜述） pytorch例子學習——NEURAL TRANSFER USING PYTORCH神經遷移風格遷移論文理解--A Neural Algorithm of Artistic Style 阿里雲NAS文件遷移項目實踐