論文筆記-Sequence to Sequence Learning with Neural Networks

本文轉載自查看原文 2017-12-23 16:37 1479 deep learning/ 論文筆記

大體思想和RNN encoder-decoder是一樣的，只是用來LSTM來實現。

paper提到三個important point：

1）encoder和decoder的LSTM是兩個不同的模型

2）deep LSTM表現比shallow好，選用了4層的LSTM

3）實踐中發現將輸入句子reverse后再進行訓練效果更好。So for example, instead of mapping the sentence a,b,c to the sentence α,β,γ, the LSTM is asked to map c,b,a to α,β,γ, where α, β, γ is the translation of a, b, c. This way, a is in close proximity to α, b is fairly close to β, and so on, a fact that makes it easy for SGD to “establish communication” between the input and the output.

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Convolutional Sequence to Sequence Learning 論文筆記【論文閱讀】Sequence to Sequence Learning with Neural Network 【論文筆記】Learning Convolutional Neural Networks for Graphs 論文筆記《Notes on convolutional neural networks》論文筆記 — Learning to Compare Image Patches via Convolutional Neural Networks 論文筆記之：Learning Multi-Domain Convolutional Neural Networks for Visual Tracking [CVPR2015] Is object localization for free? – Weakly-supervised learning with convolutional neural networks論文筆記論文筆記系列-Speeding Up Automatic Hyperparameter Optimization of Deep Neural Networks by Extrapolation of Learning Curves Sequence Models - Recurrent Neural Networks