How to get the Accuracy from MitieFeaturizer to DIETClassifier or CRFEntityExtractor

jingj5 · July 7, 2020, 7:05am

Hello, guys

I’m using rasa to train a model with chinese. And then I have a problem. it’s about rasa nlu pipeline. First I was using MitieEntityExtractor to extract entities. It’s works perfects. But the trainning time is too long. So I had change my extractor to DIETClassifier and CRFEntityExtractor. The problem is beginning.

I was asking a question to my bot: What’s the weather like today? The MitieEntityExtractor extracted “Today” entity. good one. But The DIETClassifier and CRFEntityExtractor extracted whole of the question “What’s the weather like today” both. How to improve this?

Please look below. The config.yml file between with these extractors is only extractor.

MitieEntityExtractor and extracted results

language: “zh”

pipeline:

name: “MitieNLP” model: “data/total_word_feature_extractor_zh.dat”

name: “JiebaTokenizer” dictionary_path: “data/dict”

name: “MitieEntityExtractor”

name: “EntitySynonymMapper”

name: “RegexFeaturizer”

name: “MitieFeaturizer”

name: “SklearnIntentClassifier”

policies:

name: KerasPolicy epochs: 500 max_history: 5

name: FallbackPolicy fallback_action_name: ‘action_default_fallback’

name: MemoizationPolicy max_history: 5

name: FormPolicy

怎么样今天的天气 { “intent”: { “name”: “request_weather”, “confidence”: 0.45935003402929125 }, “entities”: [ { “entity”: “date_time”, “value”: “今天”, “start”: 3, “end”: 5, “confidence”: null, “extractor”: “MitieEntityExtractor” } ] }

CRFEntityExtractor and extrated results

language: “zh”

pipeline:

name: “MitieNLP” model: “data/total_word_feature_extractor_zh.dat”

name: “JiebaTokenizer” dictionary_path: “data/dict”

name: “RegexFeaturizer”

name: “MitieFeaturizer”

name: “CRFEntityExtractor”

name: “EntitySynonymMapper”

name: “SklearnIntentClassifier”

policies:

name: KerasPolicy epochs: 500 max_history: 5

name: FallbackPolicy fallback_action_name: ‘action_default_fallback’

name: MemoizationPolicy max_history: 5

name: FormPolicy

怎么样今天的天气 { “intent”: { “name”: “request_weather”, “confidence”: 0.5135718909497647 }, “entities”: [ { “entity”: “date_time”, “start”: 0, “end”: 8, “confidence_entity”: 0.6655249585642125, “value”: “怎么样今天的天气”, “extractor”: “CRFEntityExtractor” } ],

DIETClassifier and extracted results

language: “zh”

pipeline:

name: “MitieNLP” model: “data/total_word_feature_extractor_zh.dat”

name: “JiebaTokenizer” dictionary_path: “data/dict”

name: “RegexFeaturizer”

name: “MitieFeaturizer”

name: “DIETClassifier” intent_classification: False

name: “EntitySynonymMapper”

name: “SklearnIntentClassifier”

policies:

name: KerasPolicy epochs: 500 max_history: 5

name: FallbackPolicy fallback_action_name: ‘action_default_fallback’

name: MemoizationPolicy max_history: 5

name: FormPolicy

怎么样今天的天气 { “intent”: { “name”: “request_weather”, “confidence”: 0.4984465527505863 }, “entities”: [ { “entity”: “date_time”, “start”: 0, “end”: 8, “value”: “怎么样今天的天气”, “extractor”: “DIETClassifier” } ],

This is the test result json file: CRFEntityExtractor_errors.json (12.5 KB)

Tanja · July 14, 2020, 1:22pm

I guess you are not using the latest version of Rasa. We had some issues extracting entities in Chinese. Can you try updating to the latest Rasa version? Does the error still occur?

Topic		Replies	Views
Mitie Entity Extraction Not working properly Rasa Open Source	4	805	December 2, 2021
Is it possible to use CRFEntityExtractor after DIET? Rasa Open Source	1	469	September 28, 2020
Using the CRFEntityExtractor with the DIETClassifier Rasa Open Source	16	5505	July 22, 2024
Using DIETClassifier for extracting entities makes no response Rasa Open Source	0	401	February 7, 2021
Use Huggingface model as entity extractor insted of DIETClassifier Rasa Open Source	2	135	May 14, 2024

How to get the Accuracy from MitieFeaturizer to DIETClassifier or CRFEntityExtractor

Related topics