Hi @Ghostvv. Thanks! I did some research on those and followed the advice from other posts about removing “title” like OP. This is what my pipeline looks like. Based on what I’ve read here, my model should be case insensitive, but it is not.
language: "en"
pipeline: #"supervised_embeddings"
- name: "WhitespaceTokenizer"
case_sensitive: false
- name: "RegexFeaturizer"
- name: "CRFEntityExtractor"
features: [
["low", "upper"],
["bias", "low", "prefix5", "prefix2", "suffix5", "suffix3",
"suffix2", "upper", "digit", "pattern"],
["low", "upper"]
]
- name: "EntitySynonymMapper"
- name: "CountVectorsFeaturizer"
oov_token: oov
- name: "EmbeddingIntentClassifier"
epochs: 50
intent_tokenization_flag: true
intent_split_symbol: "+"