Diet Architecture: Transformer Output of CLS and Intent Similarity

setopaisen · April 30, 2020, 4:50pm

Hello, i’ve watched DIET algorithm whiteboard eps 1 and 2 on youtube. I tried to understand the explanation, especially on Similarity between Transformer output of CLS and Intent labels.

The video explain that the output of Transformer Block ( also CLS ) are large numeric vector [256] and then Embedded to calculate the similarity with Intent Labels . So i’ve a few question here :

1. Can Transformer Block process the one hot encoding vector ? Since there's a Input Embedding on both Encoder and Decoder layers.

`

Would you like to explain about what kind of Embedding on Intent Labels ? Does it embed every training data that has target intent ? For example: Play Games Intent has 10 training sentences

ask2

I’m very excited about Rasa , great architecture and also give amazing way to explanain what behind.

Any answers and clue would be appreciated so much , Thanks

setopaisen · May 2, 2020, 1:24pm

hello, any idea for this problems ?

Topic		Replies	Views
Access to the input generated for the DIET Classifier Rasa Open Source	2	326	July 24, 2020
Injecting pretrained sentence level semantic features to the DIETClassifier Rasa Open Source	0	461	December 31, 2021
Getting CLS from DIET intent classifier at inference Rasa Open Source	0	359	January 15, 2021
Question about how DIET processes new data Rasa Open Source	1	395	December 14, 2021
Train on very small set? Rasa Open Source	0	249	March 10, 2021

Diet Architecture: Transformer Output of CLS and Intent Similarity

Related topics