Hi, I found several forum posts advising people to use the following for implementing a case insensitive pipeline.
- name: WhitespaceTokenizer
case_sentitive: False
but when I tried to use this, I get the following warning when training my model:
UserWarning: You have provided an invalid key `case_sentitive` for component `WhitespaceTokenizer` in your pipeline. Valid options for `WhitespaceTokenizer` are:
- intent_tokenization_flag
- token_pattern
- intent_split_symbol
Python version: 3.8.0
Rasa version: 2.2.0
my pipeline:
- name: WhitespaceTokenizer
case_sentitive: False
- name: RegexFeaturizer
case_sensitive: False
- name: LexicalSyntacticFeaturizer
- name: CountVectorsFeaturizer
- name: CountVectorsFeaturizer
analyzer: char_wb
min_ngram: 1
max_ngram: 4
- name: DIETClassifier
# entity_recognition: False
epochs: 150
- name: CRFEntityExtractor
- name: EntitySynonymMapper
# - name: ResponseSelector
# epochs: 100
- name: FallbackClassifier
threshold: 0.5