What is the Rasa NLU compatible output format for Tokens?

Jasmin25 · July 19, 2019, 2:11pm

From the tutorial here, I can see output format for entities: https://blog.rasa.com/enhancing-rasa-nlu-with-custom-components

I want to fetch tokens from SpacyTokenizer (the previous component in nlu pipeline), operate on them and return to the SpacyFeaturizer(the next component in nlu pipeline).

This is what my process() function looks like:

def process(self, message, **kwargs):
        """Retrieve the tokens of the new message, pass it to the classifier
            and append prediction results to the message class."""
        
        tokens = [t.text for t in message.get("tokens")]
        corrected_tokens = self.preprocessing(tokens)

        tokens = self.convert_to_rasa(corrected_tokens)

        message.set("tokens", [token], add_to_output=True)

What should be in my convert_to_rasa function?

Thanks!

Jasmin25 · July 24, 2019, 9:55am

Any help here?

nurakib · March 4, 2020, 10:31am

Facing similar issue, can anyone respond?

Topic		Replies	Views
Questions of Rasa with Spacy Rasa Open Source	2	285	November 23, 2023
NLU not predicting entities separated by the '/' character in the new version of Rasa. Why? Rasa Open Source	3	442	June 11, 2020
Understand synonym, pipeline component, entities and nlu extraction Rasa Open Source	3	1260	September 11, 2021
NLU Pipeline Debugging --- how to get and use an instance from a Rasa NLU Pipeline Rasa Open Source	3	805	May 16, 2022
Lemmatization & Punctuations Rasa Open Source	9	3131	September 25, 2019

What is the Rasa NLU compatible output format for Tokens?

Related Topics