Failing at intent classification

shubham · August 1, 2019, 12:40pm

I am training my model which has following data but I am failing at intent classification. can anyone suggest why its happening and suggest better parameter for intent classifier to improve dection.

intent	Examples of intents
budget_details	13
goodbye	24
affirm	29
deny	36
greet	42
product_submission	2776
request_waiver	2976
customer_details	3011
product_details	4913
product_information	26516

@akelad @JiteshGaikwad @Juste @JulianGerhard

JulianGerhard · August 1, 2019, 1:20pm

Hi @shubham

this is a bit vague to be honest. Could you please provide some verbose information?

Are you able to train the bot?
Are you able to start the bot?
Could you start the bot with -vv flag and post the output here?

Besides it seems a bit unbalanced where “a bit” means a lot. Is your classifier working but you are not satisfied with its performance? If so, please post your pipeline, such that we can analyze the problem properly!

Regards Julian

SamS · August 1, 2019, 3:07pm

Hey @shubham, may I see the confusion matrix produced by rasa test nlu? Essentially, I want to see how exactly your intent classifier fails.

shubham · August 2, 2019, 6:09am

language: en

pipeline:

name: “SpacyNLP” model: “en_core_web_lg”
name: “SpacyTokenizer”
name: “SpacyFeaturizer”
name: “RegexFeaturizer”
name: DucklingHTTPExtractor url: http://localhost:8000 dimensions:
- time
- amount-of-money
name: “CRFEntityExtractor” features: [[“low”, “title”,“pos”, “pos2”], [“bias”, “low”, “prefix5”, “prefix2”, “suffix5”,“digit”, “suffix3”,“suffix2”,“upper”, “title” ,“pattern”], [“low”, “title”, “upper”, “pos”, “pos2”]] BILOU_flag: true max_iterations: 50 L1_c: 0.1 L2_c: 0.1
name: “EntitySynonymMapper”
name: CountVectorsFeaturizer

OOV_token: OOV

token_pattern: (?u)\b\w+\b
name: “EmbeddingIntentClassifier”

“hidden_layers_sizes_a”: [256, 128]

“hidden_layers_sizes_b”:

“batch_size”: 10 “epochs”: 300

“embed_dim”: 20

“mu_pos”: 0.8
“mu_neg”: -0.4
“similarity_type”: “cosine”
“num_neg”: 20

“use_max_sim_neg”: true
“random_seed”: 50

“C2”: 0.002

“C_emb”: 0.9

“droprate”: 0.2

“intent_tokenization_flag”: true

“intent_split_symbol”: “+”

“evaluate_every_num_epochs”: 10
“evaluate_on_num_examples”: 1000

policies:

name: MemoizationPolicy
name: KerasPolicy
name: MappingPolicy
name: FormPolicy

SamS · August 5, 2019, 7:25am

Hey @shubham, thanks for posting the pipeline, I think it will come handy a bit later.

For now, can, please, you explain what is not working and ideally provide the kind of information (either error messages or the intent confusion matrix, etc.) that would help me or @JulianGerhard help you, as we explained earlier? Thanks.

Topic		Replies	Views
Intent classification poor even with exact matches Rasa Open Source	9	1163	June 5, 2020
Model unable to classify intents Rasa Open Source	1	591	September 17, 2018
Rasa bot is not predicting the correct intents Rasa Open Source	11	859	September 23, 2020
Rasa with spaCy Rasa Open Source	3	526	March 3, 2022
Rasa detect random meaningless sequence of character as intent with high confidence Rasa Open Source	2	1017	December 10, 2019

Failing at intent classification

Related topics