Thanks for Rasa team’s wonderful work. I am a new Rasa user and learning some basic usages.
I’m building Chinese weather NLU models which includes city slot.
After training the models, some entities which in the lookup table but not in the examples such as
'西安' could be tagged as common.city entity, but some entities could not be tagged out, such as
'日照'. I am confused about this result. With the same context and same entity regex feature value, why the output is different for this two entities?
Maybe I miss something in the config for Chinese? Or I need to add more data?
Here is my data and config.
nlu: - intent: ask_weather examples: | - 查一下 [上海](common.city) 天气 - 查一下 [苏州](common.city) 天气 - 查一下 [无锡](common.city) 天气 - 查一下 [杭州](common.city) 天气 - [上海](common.city) 天气 ..... - lookup: common.city examples: | - 上海 - 北京 - 苏州 - 西安 - 广州 - 纽约 - 日照 ...
- name: JiebaTokenizer - name: RegexFeaturizer use_word_boundaries: False - name: CountVectorsFeaturizer - name: CountVectorsFeaturizer analyzer: "char_wb" min_ngram: 1 max_ngram: 4 - name: DIETClassifier epochs: 100 - name: EntitySynonymMapper