Rasa provides Ngram features component by default which able to do character level.
But I am looking for word level N-gram feature extraction.
How can I achieve that? Or Will I need to implement my own component for word level N-gram feature extraction. If yes, The below pipline looks correct or not?
The CountVectorsFeaturizer supports word n-grams as well. Take a look at all the available options here. In particular, you can use the (default) word analyzer and set your desired n-gram minimum and maximum lengths like this:
Does this answer your question? Also, the upcoming Rasa summit is a cool opportunity to meet Rasa contributors, creators and users, and discuss anything Rasa-related