Need help for data training

prasanth55555 · March 13, 2020, 5:58am

Hi guys. i have a doubt about my training data. i am to create a nlu with multiple intents and i have a large amount of data for training so i prefer this config. `language: “en”

pipeline:

name: “WhitespaceTokenizer”
name: “RegexFeaturizer”
name: “deepPavlov.DeepPavlov”
name: “CRFEntityExtractor” features: [ [“low”, “title”, “upper”], [“bias”, “low”, “prefix5”, “prefix2”, “suffix5”, “suffix3”,“suffix2”, “upper”, “title”, “digit”, “pattern”], [“low”, “title”, “upper”] ]
name: “EntitySynonymMapper”
name: “CountVectorsFeaturizer”
name: “EmbeddingIntentClassifier”
name: “DucklingHTTPExtractor” url: http://rasa-support timezone: UTC dimensions:
- time
- number
- amount-of-money
- distance
- ordinal

policies:

name: MemoizationPolicy
name: KerasPolicy
name: MappingPolicy`

my doubt is , as i have multiple intents few looks so similar just some key words differ, should i use those keywords as slots(Entity) for best usage of nlu recognization or should i use that as a normal text. which method will increase my acurracy and good matching. Thanks in advance.

capgos17 · March 13, 2020, 6:23am

hey @prasanth55555 could you please give some example about how two sentences seems similar but with different intent.

I think giving those keywords as entity doesn’t effect.

prasanth55555 · March 13, 2020, 6:41am

@capgos17 lets say for example i have two intents namely

intent: listVideoIntent

list all my videos
list all my dvd
list all my movies

intent: listAudioIntent

list all my musics
list all my music
list all my mp3

i have these kind of similar intents. i cant have these intents in same intent as well. so please suggest me a way for making the nlu to recognize the text well.

prasanth55555 · March 13, 2020, 6:41am

`thanks for your reply @capgos17

capgos17 · March 13, 2020, 6:50am

yes you are right There should be only one intent.

intent: listContent

list all my videos
list all my audio

synonym:videos

dvd
mp4
mkv

synonym:audio

music
mp3
musics

capgos17 · March 13, 2020, 6:51am

may I know why?

prasanth55555 · March 13, 2020, 7:16am

the server side program have been there already to support seperate intent format. so i cant modify that. and another nlu is using that.

Topic		Replies	Views
Choosing NLU pipeline Rasa Open Source	6	1318	December 16, 2019
Rasa nlu training problem Rasa Open Source	2	827	December 23, 2019
Rasa nlu for long question Rasa Open Source	4	566	April 21, 2020
Rasa NLU in Depth - Part 1: Intent Classification Tutorials, Resources & Videos	0	2696	February 21, 2019
NLU Training data - migrating from DialogFlow Rasa Open Source	3	1558	February 26, 2024