Too high confidence for non-related messages

kumek · February 7, 2019, 3:46pm

Hi all,

I’m building bot to response for user’s questions and I have an issue.

Rasa gives me high level of confidence for messages that are completely not related to intent’s examples.

I have medical-related intents but message like “I like coffee” gives me even more confidence than messages related. Also, random chars messages like “laj jfias jjlas fe” also give me high confidence.

Could anyone give me a hint how to fix this? Where can I look for a bug?

This is my config:

language: "en"

pipeline:
- name: "nlp_spacy"
- name: "tokenizer_spacy"
- name: "intent_entity_featurizer_regex"
- name: "intent_featurizer_spacy"
- name: "ner_crf"
- name: "ner_synonyms"
- name: "intent_classifier_sklearn"

MetcalfeTom · February 28, 2019, 10:00am

Hi @kumek,

What does your training data look like? Could you post the data file here?

Topic		Replies	Views
Rasa NLU without Rasa Core Getting Started with Rasa confidence	4	194	August 23, 2019
Too high confidence level for something completely non relevant Rasa Open Source	1	211	February 13, 2024
Rasa with spaCy Rasa Open Source	3	526	March 3, 2022
NLU detects random input with wrong intent and high confidence Rasa Open Source	39	5224	July 27, 2022
How can we improve confidence score of intents Rasa Open Source	7	4674	October 15, 2018

Too high confidence for non-related messages

Related topics