Pre-Trained vectors for tensorflow pipeline

parthsharma1996 · September 1, 2018, 10:01am

We recently switched over to the tensorflow_embedding pipeline (mostly due to multiple intent support and less memory consumption).

However (due to the architecture of the pipeline), the model doesn’t generalize over new examples at all.

In many cases the models returns an incorrect intent with very high confidence and the correct intent (along with many others) is returned as zero.

Is it possible to have a structured way of solving this problem?

I tired the pipeline specified in this thread

but it didn’t work and I got the same results.

souvikg10 · September 2, 2018, 6:21pm

what type of SpaCy’s model are you using?? small, medium or large and in which language?

adrianhumphrey111 · September 2, 2018, 6:40pm

hey man, could you take a look at this for me?

http://forum.rasa.com/t/agent-handle-message-no-longer-works/815

parthsharma1996 · September 4, 2018, 8:32am

I’m using English. How does one find out whether small,medium or large is being used?

souvikg10 · September 4, 2018, 8:34am

Are you using the spacy model installed using rasa ? how did you install the spacy backend?

parthsharma1996 · September 4, 2018, 8:44am

I used the command python -m spacy download en to download it. Running it right now seems to download the small model.

souvikg10 · September 4, 2018, 8:57am

Probably why you don’t a see a difference in performance. The small model of Spacy does not contain a lot of word vectors. Tensorflow is non linear so my assumption is that it should fit better compared to SVM(sklearn pipeline)

But if you want to use pre-trained vectors, maybe try the larger models and see the difference

parthsharma1996 · September 4, 2018, 9:04am

Okay. I will try that and let you know. But since I have the small models not installed, after installing the large model, how do I ensure that rasa uses the large model and not the small one?

parthsharma1996 · September 4, 2018, 9:06am

nevermind, got it!

Topic		Replies	Views
Decision about using a pre-trained words embeddings or not Getting Started with Rasa	3	169	June 3, 2020
Improve Rasa NLU model Rasa Open Source	5	2168	October 15, 2019
spaCy pretrained models break chatbot NLU capacities Rasa Open Source	6	780	October 16, 2019
New Language support docs inconsistency Rasa Open Source	4	539	August 12, 2020
Rasa NLU without Rasa Core Getting Started with Rasa confidence	4	194	August 23, 2019

Pre-Trained vectors for tensorflow pipeline

Related topics