Adding text preprocessing component to Rasa

OmarGr · May 27, 2019, 5:57pm

Hi, I would like to add a text preprocessing custom component in the beginning of the Rasa pipeline. Mainly, my config file will look like;

pipeline:

name: "preprocessing"
name: “nlp_spacy” model: “fr”
name: “spacy_tokenizer”
name: “intent_entity_featurizer_regex”
name: “ner_crf”

Could I add it without giving a value to “provides”? because I don’t want to change the script of nlp_spacy component. Any suggestion is highly appreciated!

Thanks!

MetcalfeTom · May 28, 2019, 10:25am

Hi @OmarGr,

What will the preprocessor do? This should work fine with an empty list, i.e. provides = list()

OmarGr · May 29, 2019, 12:39am

Hi @MetcalfeTom, Thank you for your reply, I really appreciate it! However, it doesn’t solve my problem!

Preprocessing: detect emojies and add space between them and the adjacent word. My Pipeline is the following:

language: “fr”

pipeline:

name: “preprocessing_component.preprocessor”
name: “nlp_spacy” model: “fr”
name: “tokenizer_spacy”
name: “intent_entity_featurizer_regex”
name: “ner_crf”

My preprocessing component will feed the nlp_spacy with its output “sentence”, the script is below:

class preprocessor(Component): “”“preprocessor”""

name = "proprocessing_component"
provides = {"sentence"}
defaults = {}

. . . .

def train(
    self, training_data: TrainingData, config: RasaNLUModelConfig, **kwargs: Any
) -> None:

    for example in training_data.training_examples:
        example.set("sentence", self.add_space(example.text))

def process(self, message: Message, **kwargs: Any) -> None:

    message.set("sentence", self.add_space(message.text))

How can I add this component without changing nlp_spacy? Your suggestions are highly appreciated!

Thanks, Omar

ShaileshSridhar2403 · February 7, 2020, 8:07am

Hi @OmarGr,

I’m facing a similar problem right now. Did you manage to solve this issue? Any help would be greatly appreciated.

Thanks!

ermarkar · March 9, 2021, 6:04am

you got the solution ?

Topic		Replies	Views
Writing a custom component to preprocess text Rasa Open Source	0	664	July 27, 2022
Rasa com Rasa Open Source	13	1573	April 24, 2020
Training with custom components through HTTP API Rasa Open Source	2	541	April 18, 2019
Preprocessing input user message Rasa Open Source	0	326	April 15, 2021
Specify component input in RASA NLU Rasa Open Source	1	698	June 18, 2019

Adding text preprocessing component to Rasa

Related topics