How to integrate own Custom tokenizer

Sanju7820 · March 8, 2022, 4:56am

hello guys I have transliteration Hindi dateset.With the inbuilt pipeline of RASA,the accuracy of the trained model is not up-to the mark.I have built a custom Tokenizer .How to replace the inbuilt Tokenizer with my own custom Tokenizer.

stephens · March 8, 2022, 7:15pm

There’s a discussion in the docs here and you’ll find the source code for several example tokenizers here.

dasomx · November 15, 2022, 2:57am

After creating a custom tokenizer, what is the next step? I wonder if I can add a custom tokenizer to my current rasa project. I can’t find the folder name rasa in my rasa project. Where should I put the file in my project to override the NLU components?

Sanjukta.bs · February 12, 2024, 1:34pm

Is there any solution for this?

stephens · March 15, 2024, 3:30pm

You’ll find docs here. Once you’ve created your custom component, such as a tokenizer, include it in the config.yml as shown in this docs example. Do a train and run with --debug and make sure you see the component being used.

Topic		Replies	Views
How to fix custom component problem? Rasa Open Source	2	721	May 26, 2019
How to use own model in pipeline? Rasa Open Source	24	4312	March 1, 2021
Creating Custom Components (Input and Output) Rasa Open Source	2	1063	January 3, 2021
Enhance rasa with a pre-trained classification model Rasa Open Source	4	2090	October 22, 2019
Have my own language tokenizer and specific classifiers Rasa Open Source	1	561	January 26, 2019

How to integrate own Custom tokenizer

Related topics