I tried a comparison of two NLU-only pipelines, but I’m not sure where to look so that I can interpret the results. Attached is a sample from one of my tests
nlu_model_comparison_graph.pdf (25.1 KB)
Does this mean that the
spacy config gives you both right and wrong answers both at high confidence ratings over the other? Are there other things to understand from this graph?