Is there a doc on how to interpret pipeline comparison results?

ganeshv · March 11, 2021, 1:53pm

I tried a comparison of two NLU-only pipelines, but I’m not sure where to look so that I can interpret the results. Attached is a sample from one of my tests

nlu_model_comparison_graph.pdf (25.1 KB)

Does this mean that the spacy config gives you both right and wrong answers both at high confidence ratings over the other? Are there other things to understand from this graph?

SamS · March 12, 2021, 10:27am

Hey @ganeshv, I think there are two graphs that got plotted over each other by mistake. Could you, please, report this as a bug on Github and tag @dakshvar22 on the issue?

ganeshv · March 12, 2021, 12:17pm

Hello @SamS will do. Can this file still be interpreted or we’d have to wait till the bug is fixed?

SamS · March 15, 2021, 7:56am

@ganeshv in all honesty, I don’t know. My guess would be that both of the graphs capture legit and true information, it’s just that the line graph lost information about the scale of its axes…

ganeshv · March 15, 2021, 12:14pm

Hello @SamS - here’s the error reported.

emanuelvieira · May 13, 2021, 10:00am

Hello, I have the same problem in interpreting my generated graph, I generated the graph below for comparison but I know exactly what it means and how to read it. this is my graph:

Please help me with this I thank you in advance

SamS · May 13, 2021, 11:52am

@jjuzl I can see the bug report for this weird plotting being tagged as future. Would it make sense to prioritise it soon? Right now, I can’t really help @emanuelvieira or others read the graph because it’s simply broken…

Topic		Replies	Views
Interpret pipeline comparison results Rasa Open Source	1	201	May 14, 2021
Evaluating a model - clarifications needed Rasa Open Source	0	406	February 5, 2020
Graphics' overlap Rasa Open Source	0	202	June 30, 2021
NLU comparison does not plot results Rasa Open Source	3	591	April 7, 2020
New Language support docs inconsistency Rasa Open Source	4	530	August 12, 2020

Is there a doc on how to interpret pipeline comparison results?

Related topics