FallbackClassifier threshold advice

laboratory · April 29, 2021, 4:37pm

In my config.yml, my pipeline for FalbackClassifier looks like the following:

- name: FallbackClassifier
    threshold: 0.7
    ambiguity_threshold: 0.1

But during bot testing running (rasa shell nlu) I find this threshold is too much for some intents. Some of my intents fall within 0.6xxx and a little just slightly below 0.6.

My questions are:

is it advisable to reduce the threshold value for my FallbackClassifier
if yes, what’s the trade-off for reducing the threshold value
IF NO, how can I make my intent prediction score 0.7 and above?

Thanks in anticipation.

koaning · April 30, 2021, 7:52am

Hi Charles,

this is still an open research question. It depends a lot on your data which makes it hard to give specific advice for your situation. In my experience it involves a lot of trial and error to get right. That said, there is some general advice that you might find helpful.

We recently introduced a change to our DIET algorithm which should make the “confidence” output more reliable. There’s a full explanation of the technical details on our algorithm whiteboard but in general we now recommend setting the model_confidence parameter to linear_norm and the constrain_similarities to True.
The threshold parameter determines when a fallback is triggered but only to an extent. We shouldn’t forget that the data that you train on will influence the pipeline too. There’s something to be said to address this issue by adding more training data. If we have more relevant data to learn from, odds are that the pipeline will also be able to quantify the confidence better.
Another way of dealing with “early triggering of fallbacks” is that you could consider not sending the user a general message (like, "could you rephrase?") but instead trigger a custom action. The custom action could load buttons which represent the top 3 intents, that the user can press. That way, it comes more of a minor inconvenience for the user. There’s an example of such an action here.

laboratory · April 30, 2021, 8:02am

Thank you. This makes a lot of sense.

Following the Rasa documentation, I am already using your first suggestion. Your second and third advice also makes a lot of sense.

I will consider starting with advice 2 for now then would consider 3 in the future (if necessary) but it makes much sense.

koaning · April 30, 2021, 8:22am

Happy to hear it.

Should you have a tangible example of something strange happening, do let me know here. The research team appreciates feedback

Topic		Replies	Views
FallbackClassifier threshold always decrease after introducing new Intents and Training data to RASA Feedback on Rasa Open Source	0	392	August 9, 2022
Model incorrectly classifying intent confidence Rasa Open Source	1	203	June 16, 2023
Create a custom buttonless fallback policy that doesn't suggest intent if the confidence score is extremely low Rasa Open Source	0	900	March 3, 2022
How can set a fallback policy if the response selector has a lower confidence level? Rasa Open Source	16	2021	September 14, 2022
Rasa classifies random input as intents with high probability Rasa Open Source	24	1582	April 20, 2023

FallbackClassifier threshold advice

Related topics