Understanding Seq2Seq Neural Networks – Part 8: When Does the Decoder Stop?

In the previous article , we saw the translation being done. But there is an issue. The decoder does not stop until it outputs an EOS token . So, we plug the word "Vamos" into the decoder’s unrolled embedding layer and unroll the two LSTM cells in each layer. Then, we run the output values (short-term memory or hidden states) into the same fully connected layer. The next predicted token is EOS . How the Decoder Works So now, this means we translated the English sentence "let’s go" into the correct Spanish sentence. For the decoder, the context vector , which is created by both layers of encoder unrolled LSTM cells, is used to initialize the LSTMs in the decoder. The input to the LSTMs comes from the output word embedding layer, which starts with EOS . After that, it uses whatever word was predicted by the output layer. In practice, the decoder keeps predicting words until it predicts the EOS token or reaches some maximum output length. All these weights and biases are trained using bac

Understanding Seq2Seq Neural Networks – Part 8: When Does the Decoder Stop?

Related Articles

5 Campfire Songs Anyone Can Play on Guitar (Free Chord Charts)

Bybit vs HTX — Which Crypto Exchange Is Better? (2026)

Stop Posting Noise: Building in Public Needs Real Value

We got an audience with the "Lunar Viceroy" to talk how NASA will build a Moon base

Greatings

Related Articles

How-To
5 Campfire Songs Anyone Can Play on Guitar (Free Chord Charts)
Dev.to Beginners • 2h ago

How-To
Bybit vs HTX — Which Crypto Exchange Is Better? (2026)
Dev.to Beginners • 2h ago

How-To
Stop Posting Noise: Building in Public Needs Real Value
Dev.to Beginners • 3h ago

How-To
We got an audience with the "Lunar Viceroy" to talk how NASA will build a Moon base
Ars Technica • 4h ago

How-To
Greatings
Dev.to Tutorial • 4h ago