LSTM&GRU原理及pytroch實現

本文轉載自查看原文 2019-11-23 14:32 277 機器學習

pytorch里有一個num_layers,是指參數共享之后網絡也有不同cell，即相當於隱含層的數目，是指cell串聯和mlp很像，即為StackedRNN具體如下圖

pytorch中rnn的輸出有output和hidden，區別如下圖（lstm）

其中：

Outputs: output, (h_n, c_n)

output (seq_len, batch, hidden_size * num_directions): tensor containing the output features (h_t) from the last layer of the RNN, for each t. If a torch.nn.utils.rnn.PackedSequence has been given as the input, the output will also be a packed sequence.
h_n (num_layers * num_directions, batch, hidden_size): tensor containing the hidden state for t=seq_len
c_n (num_layer, batch, hidden_size): tensor containing the cell state for t=seq_len

一種實現，但其實現的多層lstm不太對https://github.com/emadRad/lstm-gru-pytorch/

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 RNN梯度消失&爆炸原因解析與LSTM&GRU的對其改善 RNN & LSTM & GRU 的原理與區別【轉】RNN/LSTM/GRU原理詳述自己動手實現深度學習框架-7 RNN層--GRU, LSTM RNN、lstm、gru詳解 LSTM和GRU對比 GRU和LSTM比較 RNN、lstm和GRU推導 RNN - LSTM - GRU RNN、LSTM、GRU