Question about embedding policy architecture

jyoungrok · December 13, 2018, 10:08am

Hello. I read paper about REDP (https://arxiv.org/pdf/1811.11707.pdf). According to section 3 RNN part, the output of lstm is fed to embedding layer and then the sum of it and system attention vector is used as dialogue state embedding.

(The output of this cell is fed to another embedding layer to create an embedding of the cell output for the current time step. The sum of this embedded cell output and system attention vector is used as the dialogue state embedding.)

However, in the Figure 2, the output of lstm is added to system attention vector and then fed to embedding layer, which is the purple box on the far right.

Am i misunderstanding something?

Thanks in advance.

Ghostvv · December 13, 2018, 10:27am

you’re right. It is a mistake on the scheme

jyoungrok · December 17, 2018, 12:55am

ah… okay thanks!

bamba518 · May 5, 2019, 9:49am

What are written to the user and system memory in the photos? Are these just the embedding vectors for system actions and user inputs in chronological order per each story? Then the attention mechanism produces another vector to identify which parts to ignore and which parts to pay attention to?

Ghostvv · May 6, 2019, 7:18am

yes

Topic		Replies	Views
Embedding Policy Results Rasa Open Source	15	3402	July 19, 2019
What's the future goal of embedding policy? Rasa Open Source	3	794	February 6, 2019
Source Code of Embedding Policy (TEDP) explained? Rasa Open Source	0	582	December 18, 2019
Understanding the usage of LSTM in keras_policy Rasa Open Source	1	904	January 14, 2019
LSTM model policy Rasa Open Source	1	607	February 13, 2019

Question about embedding policy architecture

Related topics