NLU Pipeline Debugging --- how to get and use an instance from a Rasa NLU Pipeline

jhamburg · December 26, 2019, 8:50pm

Once you have a trained model, how do you pass a message through all of the steps in a pipeline up until a specific one?

I’ve tried pulling out that piece of the Interpreter.pipeline and saving that as a separate object and from there running both Interpreter.process and Interpreter.partially_process but nothing seems to work.

Background

I’ve trained an NLU model and with the Interpreter object, I’d like to debug the individual components of the pipeline. By debug, I mean I would like the run a message through all of the steps before that component and through the component I’m interested in.

Example

As a simple example, if I use the basic pre-trained Spacy pipeline, I would like to pull out the second component, the Spacy Tokenizer to see how my message is tokenized. This way I can inspect what is actually being passed to CRF-Entity Extractor.

When I run: interpret = Interpreter.load(os.path.join(model_dir, 'model_name'))

tokenizer = interpret.pipeline[1]

tokenizer.process('Hello, how are you?')

It doesn’t convert the text into a Spacy Doc but instead keeps it as a str.

I’ve also tried:

tokenizer.prepare_partial_process(tokenizer.partial_processing_pipeline, tokenizer.partial_processing_context)

tokenizer.partially_process('Hellow, how are you?')

But that still doesn’t work.

I have other use cases for pulling out the components from a pipeline. For instance, I do post-processing on a specific intent to match text to a predefined list. I want to pull out the featurizer I use in my pipeline to convert the entities I extract as well as my predefined list into an embedding space. That way I can use a better string matching algorithm via a similarity vector.

Any and all help will be much appreciated.

Thank you!

ashek1520 · August 3, 2021, 3:59pm

I am also facing the same issue, I want to see what kind feature being generated when RegexFeaturizer is executed, want to see how components get updated as we move through the pipeline. I need to put a breakpoint in a method in RegexFeaturizer class and see the behaviour. Please suggest how this is done.

ygajashree · May 16, 2022, 7:53am

I also want to check the input and output for each step in the pipeline. Please guide.

tomp · May 16, 2022, 4:01pm

Ditto all the above. We need a better way to debug a pipeline, e.g. see which features are being applied to each token. E.g., the lookup table features don’t seem to be working and I need to see if they are being applied internally to all the tokens I expect.

Topic		Replies	Views
Debugging Custom Rasa NLU Component during Training Rasa Open Source paula	2	1831	May 7, 2019
Clarification regarding NLU Pipeline and DIETClassifier Rasa Open Source	4	1582	March 4, 2021
NLU pipeline - Inspecting the Message-object yourself Rasa Open Source	3	324	March 15, 2023
Understand synonym, pipeline component, entities and nlu extraction Rasa Open Source	3	1382	September 11, 2021
How to configure the pipeline using other language? Rasa Open Source	1	1739	September 30, 2019

NLU Pipeline Debugging --- how to get and use an instance from a Rasa NLU Pipeline

Related topics