spaCy pretrained models break chatbot NLU capacities

zerotrope · October 16, 2019, 7:08am

Hi

Setting up the pipelines to use pretrained_embeddings_spacy whatever the language setup breaks the NLU abilities of our bot.

config.yml file content:

language: fr
pipeline: pretrained_embeddings_spacy

policies:
  - name: MemoizationPolicy
  - name: KerasPolicy
  - name: MappingPolicy
  - name: FormPolicy
  - name: "FallbackPolicy"
    nlu_threshold: 0.7 # Min confidence needed to accept an NLU prediction
    core_threshold: 0.5 # Min confidence needed to accept an action prediction from Rasa Core
    fallback_action_name: "action_incompréhension"

Is there anything else to setup ?

Thanks for your help.

z.

matthiask · October 16, 2019, 7:17am

How much training data example do you have?

zerotrope · October 16, 2019, 7:24am

Less than 1000 as I understand.

rasa.nlu.training_data.training_data  - Training data stats:
        - intent examples: 214 (10 distinct intents)
        - Number of response examples: 0 (0 distinct response)
        - entity examples: 0 (0 distinct entities)

matthiask · October 16, 2019, 7:28am

I would still try out the supervised_embeddings pipeline and/ or stay with the spacy pipeline but change single components.

zerotrope · October 16, 2019, 7:37am

Indeed, supervised_embeddings yield satisfactory results, I was just wondering about spaCy pretrained model impacts on overall accuracy & performance.

However, what are “single components” exactly ? Our proprietary content in nlu.md file ?

matthiask · October 16, 2019, 7:53am

No. For example supervised_embeddings is the same as pipeline:

name: “WhitespaceTokenizer”
name: “RegexFeaturizer”
name: “CRFEntityExtractor”
name: “EntitySynonymMapper”
name: “CountVectorsFeaturizer”
name: “CountVectorsFeaturizer” analyzer: “char_wb” min_ngram: 1 max_ngram: 4
name: “EmbeddingIntentClassifier”

Every part is a single component of the NLU pipeline and influence the NLU result. And you can play around with the single components to achieve better results.

zerotrope · October 16, 2019, 7:55am

Ah alright, I’ll take a look. Thank you very much for your help

Topic		Replies	Views
What features does Rasa NLU use from spacy? Rasa Open Source	0	670	February 21, 2019
New Language support docs inconsistency Rasa Open Source	4	539	August 12, 2020
Can not train rasa nlu with spacy models Getting Started with Rasa	3	383	September 15, 2020
Decision about using a pre-trained words embeddings or not Getting Started with Rasa	3	169	June 3, 2020
Rasa Knowledge Base bot with spacy pipeline Rasa Open Source	2	518	March 18, 2021

spaCy pretrained models break chatbot NLU capacities

Related topics