Rasa_NLU ner_crf classification issue

Kenz · May 28, 2019, 2:19pm

Hi, I’m currently building a chatbot using Rasa-NLU and using ner_crf as entity classifier in the pipeline.

I’m having around half a million training sentences with only 12 different entities. The extraction is going well but the recognition is not that accurate…

I’m trying to find why…

The pipeline is as folow: language: “fr”

pipeline:

name: “components.preprocess.PrepareString”
name: “nlp_spacy”
name: “tokenizer_spacy”
name: “ner_crf” features: [[“low”], [“bias”, “suffix3”], [“upper”, “pos”, “pos2”]]
name: “ner_synonyms”
name: “intent_featurizer_count_vectors”
name: “intent_classifier_tensorflow_embedding”

I believe that it may be due to my none understanding of the features on ner_crf… Could someone explain to me what are the different features for ?

For example:

low
title
suffix5
suffix3
suffix2
suffix1
pos
pos2
prefix5
prefix2
bias
upper
digit

Ghostvv · June 12, 2019, 5:08pm

what do you mean by?

Topic		Replies	Views
Ner_crf Rasa Open Source	12	5123	September 28, 2018
Multiple NER Rasa Open Source	10	1325	May 24, 2019
Suggestion for pipeline Rasa Open Source	1	557	April 9, 2019
Rasa NLU 0.13.0 is released! Release Announcements	7	897	August 7, 2018
NLU 0.14.4 with tensorflow 1.12.0 unable to extract entities Rasa Open Source	2	622	March 18, 2019

Rasa_NLU ner_crf classification issue

Related topics