Does Lookup Tabel works in Chinese

sdu-2044 · June 27, 2019, 7:40am

I have tried to use Look up Table in my project .I follow the instruction of the demo in improvin-entity-extraction .It works well where I use nlp_spacy , intent_entity_featurizer_regex,and ner_crf.But in my case ,I need to deal with Chinese characters.So I change the pipline to `language: “zh”

pipeline:

name: “nlp_mitie” model: “total_word_feature_extractor_zh.dat”
name: “tokenizer_jieba”
name: “ner_synonyms”
name: “intent_entity_featurizer_regex”
name: “ner_crf”
name: “intent_featurizer_mitie”
name: “intent_classifier_sklearn”` And use it to train my Chinese Data. But the “Look up Table” seems does not work in this case.So any suggestion about my problem .

skypc785308 · July 1, 2019, 6:25am

I have tired to use Chinese in rasa with lookup table ,here is my pipeline language: “zh” pipeline:

name: “JiebaTokenizer” dictionary_path: “./jieba_userdict/dict.txt”
name: “RegexFeaturizer”
name: “CRFEntityExtractor”
name: “EntitySynonymMapper”
name: “CountVectorsFeaturizer”
name: “EmbeddingIntentClassifier”

sdu-2044 · July 1, 2019, 7:03am

Thanks for your reply .I have tried your pipline ,but it seems still can’t extract the entity in my lookup.txt.can you please send me your trainning data and dict.txtn by email daneljinsheng@gmail.com.or do you have a wechat ,my wechat id is sleeping__bear.Thank you very much

skypc785308 · July 1, 2019, 7:43am

jieba_userdict is unnecessary easy sample of data/nlu.md

## intent:inform_info
- [不存在的女兒](book_name)
- [撒哈拉歲月](book_name)
- [解答之書](book_name)

## lookup:book_name
  lookup_tables/book_name.txt

lookup sample lookup_tables/book_name.txt

白夜行
斜槓青年
挪威的森林
麥田捕手
解答之書
撒哈拉歲月
不存在的女兒

with pipline above,you can try to key 白夜行 ,rasa nlu will parse intent is inform_info and 白夜行 is a entity of book_name

sdu-2044 · July 2, 2019, 2:44am

Thanks for reply. You really help me a lot. It’s ok to run with your data . My training data was not tokenized well and I finally figure it out by impove my training data .I add some special words in my userDict and add more training data to help rasa recognize it .The problem is always the data not the pipeline .I was in the wrong direction.

Topic		Replies	Views
How can i use a lookup table por my entity? Rasa Open Source	4	840	October 10, 2022
Some entities in lookup could not be extracted for Chinese NLU Model Rasa Open Source	17	565	September 7, 2021
Lookup Table doesn't work Rasa Open Source	3	1726	June 6, 2019
Use of lookup table Getting Started with Rasa	11	1065	February 27, 2020
Lookup tables configuration problem Rasa Open Source	18	2311	June 25, 2024

Does Lookup Tabel works in Chinese

Related topics