Analyse intent / entity distribution

anyone · May 7, 2021, 9:23am

Hello everyone! For big nlu files I need a way to check which entities are present in which intents and how often they occur. Ideally there is also a way to see which words are mapped to the entities. I didn’t find any solutions for this.

Do you now any tools / scripts that I can use for these questions? Or do I have to write them from scratch?

Best regards and thank you for your help!

stephens · May 13, 2021, 8:47pm

Hi Mike,

Do you want to do this against your training data or user conversations?

Greg

anyone · May 14, 2021, 6:39am

Hi Greg,

I want to do this against my training data. I want to compare different approaches of labeling the data and see which performs the best. But to make sure that my data actually follows the intended approach I need a way to check the data and catch any data that was wrongly labeled.

Mike

Polaris000 · November 8, 2021, 5:41am

Hi @anyone. I’m interested in doing this exact thing. Any updates on this? Thanks.

Polaris000 · November 8, 2021, 5:43am

@stephens does rasa provide functionality for this? Thanks.

stephens · November 9, 2021, 3:32pm

I want to compare different approaches of labeling the data and see which performs the best

For this part of your question, you can use rasa test nlu.

I need a way to check which entities are present in which intents and how often they occur.

I don’t know of an existing solution for this. Maybe one of @koaning’s projects?

I need a way to check the data and catch any data that was wrongly labeled.

I don’t think rasa data validate checks for conflicting entity labels but this would be useful. Might want to submit an enhancement request for this.

koaning · November 10, 2021, 7:00am

I wrote doubtlab which is a general tool, somewhat more aimed at the general scikit-learn ecosystem, which can help you find bad labels in your dataset. This may help with intents, entities not so much at the moment. You can also check out cleanlab for this task.

Topic		Replies	Views
Rasa test nlu: test if all entities are labeled chorrectly within a sentence Rasa Open Source testing	2	782	February 15, 2021
What controls available entities/intents for labeling in rasa x? Rasa Open Source	2	912	September 23, 2019
NLU only detecting entities explicitly present in training data Rasa Open Source	17	2783	August 8, 2021
Interactive learning for Rasa NLU entity recognition Rasa Open Source	3	345	May 28, 2020
Choosing NLU pipeline Rasa Open Source	6	1318	December 16, 2019

Analyse intent / entity distribution

Related topics