Rasa provides Ngram features component by default which able to do character level.
But I am looking for word level N-gram feature extraction.
How can I achieve that? Or Will I need to implement my own component for word level N-gram feature extraction. If yes, The below pipline looks correct or not?
- name: "WhitespaceTokenizer" - name: "Tri-GramFeature" ## own component #rasa - name: "CRFEntityExtractor" - name: "EntitySynonymMapper" - name: "CountVectorsFeaturizer" - name: "EmbeddingIntentClassifier"