Training Rasa NLU model on AWS EC2 p2.xlarge Instance

BHARATHPABBA · October 15, 2019, 3:28am

Hi, It takes 2 days to train the NLU model on my local computer. Can I use p2.xlarge instances to reduce the training time? Does Rasa NLU supports p2.xlarge ? are there any limitations?

amn41 · October 15, 2019, 7:03am

my first question is how much data do you have that it takes two days to train? My guess is that you synthetically generated this data with a script or a tool like chatito. That’s a bad idea, much cleaner to use a lookup table if you have a large number of predefined values.

BHARATHPABBA · October 15, 2019, 7:13am

Hi @amn41 Thanks for your reply.

The Rasa NLU dataset i use contains 411 intents with around 70k utterances. This data is manually generated.

amn41 · October 15, 2019, 7:27am

wow, ok, that’s a lot of annotated data. I would recommend the supervised embeddings pipeline which uses tensorflow and should scale better to this size. I’m not sure how many epochs you have set, but with that much data I suspect you can reduce it to a v small number without losing performance

BHARATHPABBA · October 15, 2019, 7:31am

yes i use supervised embeddings. currently using default epochs. i will try to change it and see again. thanks a lot for replying

BHARATHPABBA · October 16, 2019, 1:31am

hi @amn41 can you please confirm if i train the nlu model on gpu instances may reduce the training time?

prasgaut · February 27, 2020, 9:20am

Hi Bharat, I am also facing the same issue. Can you please share any finding you have got to reduce the training time. Thanks Prashant

dgslv · October 30, 2020, 10:45am

Hi guys. I’m also facing the same issue. Would love to hear from you some tips on reducing the training time.

amn41 · November 2, 2020, 10:38am

hi @dgslv - do you also have such a large amount of training data? I have to warn you that synthetically generating a lot of training data with a script is a bad idea.

dgslv · November 3, 2020, 10:04am

Hi @amn41! I have 41k manually annotated phrases on my training data. they are not synthetically generated.

amn41 · November 18, 2020, 12:01pm

wow that’s awesome! what annotation tool do you use?

Topic		Replies	Views
Rasa Model taking alot of time to train Rasa Open Source	9	2541	June 11, 2020
Rasa 2.8 Training takes too much time Rasa Open Source	1	331	May 10, 2022
[Rasa NLU] how to train the data with GPU Rasa Open Source	5	2003	June 12, 2019
Rasa nlu train with a large dataset is stuck Rasa Open Source	20	2175	April 8, 2020
About Model Train Rasa Open Source testing	1	428	September 30, 2020

Training Rasa NLU model on AWS EC2 p2.xlarge Instance

Related topics