Rasa classifies random input as intents with high probability

Jakub · July 30, 2021, 12:24pm

Hello, I have trained my model using pipeline:

name: “DucklingHTTPExtractor” url: “http://localhost:8000” dimensions: [“duration”]
name: WhitespaceTokenizer
name: RegexFeaturizer
name: LexicalSyntacticFeaturizer
name: CountVectorsFeaturizer
name: CountVectorsFeaturizer analyzer: char_wb min_ngram: 1 max_ngram: 4
name: DIETClassifier epochs: 100
name: EntitySynonymMapper
name: ResponseSelector epochs: 100
name: FallbackClassifier threshold: 0.3 ambiguity_threshold: 0.1

However random words like ‘asdf’, ‘qwerty’ and even single letters are classified as intents (‘greet’, ‘neutral’ respectively). I could increase FallbackClassifier threshold, but some examples have confidence over 0.9. I’m using Rasa 2.8. How to deal with it?

nik202 · July 30, 2021, 2:02pm

@Jakub Hi! Can you please share some example or file or screenshot?

@Jakub Why you are using ambiguity_threshold: 0.1 inside FallbackClassifier or share the link from where you get this idea?

@Jakub Please share complete config.yml or update the above one.

thorty · July 30, 2021, 2:20pm

Hi! Same on my site. No Matter what kind of input. There is always a intent recognition. Eg “123456”, “dfjjaspfj” or something like “i want pizza”.

I’m looking forward for some help. Thanks!

Using diffrent versions with docker:

rasa/rasa:main-spacy-de
rasa/rasa:2.8.1-spacy-de

pipeline: language: de

pipeline:
    # # No configuration for the NLU pipeline was provided. The following default pipeline was used to train your model.
    # # If you'd like to customize it, uncomment and adjust the pipeline.
    # # See https://rasa.com/docs/rasa/tuning-your-model for more information.
    - name: SpacyNLP
      model: de_core_news_sm
    - name: SpacyTokenizer
    - name: SpacyFeaturizer
    - name: RegexFeaturizer
    - name: LexicalSyntacticFeaturizer
    - name: CountVectorsFeaturizer
    - name: CountVectorsFeaturizer
      analyzer: char_wb
      min_ngram: 1
      max_ngram: 4
    - name: DIETClassifier
      epochs: 100
      constrain_similarities: true
    - name: EntitySynonymMapper
    - name: FallbackClassifier
      threshold: 0.3
      ambiguity_threshold: 0.1

nlu.yml: version: “2.0”

nlu:

- intent: greet

  examples: |

    - hey

    - hallo

    - hi

    - hallo du

    - guten morgen

    - guten abend

    - morgen

    - guten tag

- intent: goodbye

  examples: |

    - tschüss

    - wiederhören

    - ciao

    - bis dann

    - bye

result:

{

    "text": "12345",

    "intent": {

        "id": 8761605359927853060,

        "name": "goodbye",

        "confidence": 0.9780691862106323

    },

    "entities": [],

    "intent_ranking": [

        {

            "id": 8761605359927853060,

            "name": "goodbye",

            "confidence": 0.9780691862106323

        },

        {

            "id": -3994034142759590561,

            "name": "greet",

            "confidence": 0.02193082869052887

        }

    ]

}

nik202 · July 30, 2021, 2:25pm

@thorty Means whatever input i.e “I want pizza” , “I need drinks” do such examples are in your nlu.yml file?; it’s giving you goodbye output?

thorty · July 30, 2021, 2:48pm

No they are not into my nlu.yml as examples. In this case I’m expecting a result like: intent: none. But what I’ m getting here is something like intent: goodbye with a conf of over 90.

my nlu.yml is exactly like in the post above. the pipeline also. and the result is an example with this config.

thank you @nik202 !

nik202 · July 30, 2021, 3:05pm

@thorty Why this? any significance? @thorty Link please, I guess its used for two-stage fallback not for FallbackClassifier

thorty · July 30, 2021, 3:12pm

@Nik202 No. It’s from the doc or from the example of the init pipeline. Should I use another threshold? I will try this out later.

rctatman · July 30, 2021, 4:52pm

@thorty All your inputs will be given an intent. If you have very high confidence and a lot of incorrect classifications it’s often the case that this is due to your data not being a good representation of what your assistant is seeing in production.

You should create an out of scope intent if you want to be able to capture things your assistant can’t do. This page goes over how to do it: Fallback and Human Handoff

thorty · July 31, 2021, 6:29am

@Nik202: Thanks, but this does not change enything.

@rctatman: Thanks for your explanation and the Link to the right Step in Documentation. This helps a lot in understanding rasa.

In my case I do not really know the utterances from users. So I will start with a small set of examples and train the NLU after going live with real-world examples. To handle the “non” trained utterances the right way it would be better to get no intent rather then the wrong intent. Especially when utterances are non-sense like in my example. Is there any chance to achieve this?

nik202 · July 31, 2021, 12:27pm

@thorty ok

nik202 · July 31, 2021, 12:29pm

@Jakub Any progress about your error?

Jakub · August 2, 2021, 8:40am

@nik202 sorry for late response. I removed ambiguity_threshold: 0.1 from my config file (the one I sent in first message is my whole config.yml, I have default policies), but there’s only a slight diffrence (a little bit more of inputs are classified as nlu_fallback). I upload examples and neutral intent below.

example1 example2 example3 example4

thorty · August 2, 2021, 1:51pm

Short Update from my side:

when I also use the KeywordIntentClassifier in my pipline the NLU response as follows:

"intent": {
    "name": null,
    "confidence": 0.0
},

pipeline - name: KeywordIntentClassifier

But I think that makes the DIET Classifier with all its advantages useless. ?!?

thorty · August 4, 2021, 2:15pm

Short Update:

Like @rctatman explained. With more trainingdata I become better confidence values for my utterences. Also for “nonsense” imput like “askjfnqiurz”.

SowmyaBalam · October 12, 2021, 6:24am

hi, i am also facing the same issue, for whatever nonsense query "abcdefghijkl’ it is going to greet with high confidence of 0.99 in diet classifier. can anybody help me resolve this issue.

nik202 · October 12, 2021, 7:30am

@SowmyaBalam delete the older trained model first, some times it help and did you mention the default fallback, please share some supporting files, so that we can see.

SowmyaBalam · October 12, 2021, 7:34am

@nik202 this is the config file. i am using:

language: en

pipeline:

# No configuration for the NLU pipeline was provided. The following default pipeline was used to train your model.

# If you’d like to customize it, uncomment and adjust the pipeline.

# See Tuning Your NLU Model for more information.

name: WhitespaceTokenizer
name: RegexFeaturizer
name: LexicalSyntacticFeaturizer
name: CountVectorsFeaturizer
name: CountVectorsFeaturizer analyzer: char_wb min_ngram: 1 max_ngram: 4
name: DIETClassifier epochs: 300 evaluate_on_number_of_examples: 80 evaluate_every_number_of_epochs: 5 tensorboard_log_directory: “tensorboard” tensorboard_log_level: “epoch” checkpoint_model: True constrain_similarities: True
name: EntitySynonymMapper

- name: ResponseSelector

epochs: 100

constrain_similarities: true

- name: FallbackClassifier

threshold: 0.3

ambiguity_threshold: 0.1

Configuration for Rasa Core.

Policies

policies:

# No configuration for policies was provided. The following default policies were used to train your model.

# If you’d like to customize them, uncomment and adjust the policies.

# See Policies for more information.

name: MemoizationPolicy
name: TEDPolicy #max_history: 5 epochs: 300 evaluate_on_number_of_examples: 80 evaluate_every_number_of_epochs: 5 tensorboard_log_directory: “tensorboard” tensorboard_log_level: “epoch” checkpoint_model: True constrain_similarities: True
name: RulePolicy

nik202 · October 12, 2021, 7:36am

@SowmyaBalam can you please format the code using “</>” or “”" “”" ?

SowmyaBalam · October 12, 2021, 7:40am