Embedding Policy Results

amn41 · November 29, 2018, 10:17am

Hi Everyone!

Our new paper about the embedding policy (aka REDP) is now available: [1811.11707] Few-Shot Generalization Across Dialogue Tasks - we’ll present it at the NeurIPS conversational AI workshop next week.

There’s a blog post here that explains what the paper is about, and why it matters to Rasa developers. Let me know what you think!

Have you been using the Embedding Policy? We’d love to hear about your results, so let’s start a thread here.

ncoco · December 6, 2018, 1:10pm

This policy looks very promising

I tried to train my model, but it needs some serious processing power to even finish one epoch. Is this expected?

I am using the same hyperparams as in you paper.

datistiquo · December 19, 2018, 2:42pm

For the new Embedding you need to train stories with those chitchats and corrections. So, where is now the advantage/improvement compared to normal LSTM? Is it that you need way less stories to write such unccooperative stories, because attention layer learns not to pay attention to this part and will generalise to stories not trained?

asokolow · December 20, 2018, 12:03pm

Hi! Great feature …

Did you benchmark the training time ? I’m currently training a model by using this policy on a GTX 1080ti (12go) + 32Go ram + 32 core CPU and each epoch takes about ~10 min …

I’m using : policies:

- name: EmbeddingPolicy

epochs: 2000

attn_shift_range: 5

EDIT: I’ll answer my own question : the EmbeddingPolicy should be use with --augmentation 0

amn41 · December 20, 2018, 5:44pm

yes that’s a good description the point of the policy is that it can learn to re-use those patterns from just a few examples

amn41 · December 20, 2018, 5:45pm

yes! the attention mechanisms definitely require more computer power to train. You can also switch off one (or both) of the attentions to swap a bit of generalization power for compute time

azizullah2017 · December 31, 2018, 9:59am

@amn41 its computation is much greater then normal LSTM, and we have to write the same incoperative stories. I do not get it less number of stories.

What its advantages to use it ? It has same result in my case. LSTM 300 epochs Embedding 2000 epochs.

adrianhumphrey111 · January 6, 2019, 6:34am

Is this supposed to take 30 minutes to train on 2000 epochs??

azizullah2017 · January 7, 2019, 5:03am

@adrianhumphrey111 has already given the answer. the EmbeddingPolicy should be use with --augmentation 0 –augmentation 0 add this in your command while training

adrianhumphrey111 · January 7, 2019, 8:15am

can you please give me an example for both command line and the python file way of doing that?

azizullah2017 · January 7, 2019, 8:16am

python -m rasa_core.train -s data/stories.md -d domain.yml -o models/dialogue  -c policy.yml --augmentation 0

adrianhumphrey111 · January 7, 2019, 8:30am

Running this causes this output:

/usr/local/lib/python3.6/site-packages/pykwalify/core.py:99: UnsafeLoaderWarning: The default ‘Loader’ for ‘load(stream)’ without further arguments can be unsafe. Use ‘load(stream, Loader=ruamel.yaml.Loader)’ explicitly if that is OK. Alternatively include the following in your code:

  import warnings
  warnings.simplefilter('ignore', ruamel.yaml.error.UnsafeLoaderWarning)

In most other cases you should consider using 'safe_load(stream)'
  data = yaml.load(stream)
Processed Story Blocks: 100%|██████████████████████████████████████████████████████████████████████████████| 26/26 [00:00<00:00, 2419.24it/s, # trackers=16]
2019-01-07 08:22:41 INFO     rasa_core.agent  - Model directory models/dialogue/ exists and contains old model files. All files will be overwritten.
2019-01-07 08:22:41 INFO     rasa_core.agent  - Persisted model to '/app/kiddiecommute 2/models/dialogue'

I do not see it going over any epochs. Inside of my models/diaglogue folder, I only have the files:

domain.json           
domain.yml           
 policy_metadata.json

I do not see any models, or policy files

azizullah2017 · January 7, 2019, 8:35am

creat a file policy,yml

policies:
  - name: EmbeddingPolicy
    epochs: 2000
    attn_shift_range: 5

paste this.

adrianhumphrey111 · January 7, 2019, 8:37am

That is exactly what I have already, could I see what the output would look like?

sibbsnb · July 9, 2019, 2:45am

Did any one see improvements by using this?

tuanvuvo · July 19, 2019, 9:10am

I think just more as more data training

Topic		Replies	Views
Embedding policy training time Rasa Open Source	17	3024	August 28, 2019
Embedding policy & story processing time explosion Rasa Open Source	3	483	December 12, 2018
How to change the rasa policy from default to EmbeddingPolicy Rasa Open Source	10	2066	June 14, 2019
Embedded Policy full example of the paper Rasa Open Source	3	566	August 19, 2019
Training in Rasa-X fails with EmbeddingPolicy configuration [Deprecated] Rasa X Community Edition	1	528	February 4, 2020

Embedding Policy Results

Related topics