LSTM&GRU原理及pytroch实现

本文转载自查看原文 2019-11-23 14:32 277 机器学习

pytorch里有一个num_layers,是指参数共享之后网络也有不同cell，即相当于隐含层的数目，是指cell串联和mlp很像，即为StackedRNN具体如下图

pytorch中rnn的输出有output和hidden，区别如下图（lstm）

其中：

Outputs: output, (h_n, c_n)

output (seq_len, batch, hidden_size * num_directions): tensor containing the output features (h_t) from the last layer of the RNN, for each t. If a torch.nn.utils.rnn.PackedSequence has been given as the input, the output will also be a packed sequence.
h_n (num_layers * num_directions, batch, hidden_size): tensor containing the hidden state for t=seq_len
c_n (num_layer, batch, hidden_size): tensor containing the cell state for t=seq_len

一种实现，但其实现的多层lstm不太对https://github.com/emadRad/lstm-gru-pytorch/

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 RNN梯度消失&爆炸原因解析与LSTM&GRU的对其改善【转】RNN/LSTM/GRU原理详述 RNN、lstm、gru详解 LSTM和GRU对比 GRU和LSTM比较 RNN、lstm和GRU推导 RNN - LSTM - GRU 循环神经网络之LSTM和GRU 神经序列模型之RNN 及其变种LSTM、GRU GRU实现股票预测