How to add attention layer to seq2seq model on Keras

About

Asked 8/11, 2017 at 9:25 Answered 23/10, 2019 at 14:8

nlp deep-learning keras lstm attention-model

Based on this article, I wrote this model:

enc_in=Input(shape=(None,in_alphabet_len))
lstm=LSTM(lstm_dim,return_sequences=True,return_state=True,use_bias=False)
enc_out,h,c=lstm(enc_in)
dec_in=Input(shape=(None,in_alphabet_len))
decoder,_,_=LSTM(decoder_dim,return_sequences=True,return_state=True)(dec_in,initial_state=[h,c])
decoder=Dense(units=in_alphabet_len,activation='softmax')(decoder)
model=Model([enc_in,dec_in],decoder)

How can I add attention layer to this model before decoder?

Inanimate answered 8/11, 2017 at 9:25 Comment(1)

here a simple way to add attention: https://mcmap.net/q/586055/-how-to-add-attention-layer-to-a-bi-lstm – Roncesvalles 30/8, 2020 at 20:6

You can use this repo,

you will need to pip install keras-self-attention
import layer from keras_self_attention import SeqSelfAttention
- if you want to use tf.keras not keras, add the following before the import os.environ['TF_KERAS'] = '1'
- Make sure if you are using keras to omit the previous flag as it will cause inconsistencies

Since you are using keras functional API,

enc_out, h, c = lstm()(enc_in)
att = SeqSelfAttention()(enc_out)
dec_in = Input(shape=(None, in_alphabet_len))(att)

I hope this answers your question, and future readers

Freddie answered 23/10, 2019 at 14:8 Comment(3)

Well this is self attention. Yet, for seq2seq you will normally want to have attention between encoder and decoder states. – Femur 7/5, 2020 at 12:23

So, what do you suggest? – Freddie 7/5, 2020 at 14:19

I don't know if there is a Keras wrapper for Bahdanau or Luong attention, yet there is a neat TensorFlow 2.0 tutorial for seq2seq translation with attention. tensorflow.org/tutorials/text/nmt_with_attention – Femur 8/5, 2020 at 8:8

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags