VERY low confidence with DIETClassifier

kaladin · July 27, 2020, 1:35pm

Rasa version: Rasa 1.10.8

Hi, I wanted to try out some defferent configs for my chatbot just like in this video. I tried configs from video on Sara chatbot and the results were the same as in the video, so my rasa instalation is probably ok.

My chatbot has around 30 intents with 10-20 examples per intent. No entities.

All configs from video that I used are avaible here

First I tried config-mega-basic and it gave me pretty acceptable results.

Then I tried diet-light config and it looks like chabot is just guessing.

And the last config that I tried for my chatbot is diet-heavy, which is better then diet-light but still not even close to config-mega-basic

Summary:

Has anyone had such a problem before? Do I really need 100+ examples per intent to make DIET work?

tyd · July 28, 2020, 7:49am

Hi @kaladin. Cool post! The number of examples per intent you need to make DIET work depends on what performance you expect, how you divide your assistant into intents, and the training data you provide. 10-20 examples per intent for 30 intents seems like a really small amount to me though.

I think only you can answer if you need 100+ examples per intent. Following Conversation-Driven Development will allow you to grow a dataset until your assistant performs as well as you desire. There are some good tips in 10 Best Practices for Designing NLU Training Data that should also help along with Rasa X, which makes it easy to collect and annotate those training examples.

koaning · August 3, 2020, 7:58am

Hi Kaladin,

one regret I have with the original benchmark that you list there is that the epochs are … rather small. It might be better to push them to 100 epochs to ensure that DIET converged (I just updated the gist). Should you ever run the benchmarks again I’d love to hear about it.

What Ty said is something to keep in mind too. I’m assuming that currently running a model against data that you came up with but odds are that your users are going to behave differently.

Starting out with 20-30 examples per intent can be fine (you need to start somewhere) but eventually you’ll want to collect more data. Preferably from real users. Not just because “more data is better” but also because your examples will cover more ground, language wise.

And as Ty mentioned, Rasa X should be a great startinging point for this. It’s free and it integrates with Rasa from the get-go.

RUI-LONG · February 18, 2022, 2:18am

I love this graph !

Topic		Replies	Views
Training on large intent examples Rasa Open Source	5	1363	March 17, 2022
Large number of entities per example & cross validation expectations Rasa Open Source	0	776	April 5, 2019
Does DIET architecture work effective with a larger dataset? Rasa Open Source	1	425	June 2, 2020
Rasa NLU intent recognition models in Portuguese Rasa Open Source	2	274	July 18, 2023
DIETClassifier detecting no entities Rasa Open Source	0	164	February 11, 2024

VERY low confidence with DIETClassifier

Related topics