Running out of memory with a large number of stories

akari · March 7, 2019, 1:10pm

Hi,

I’ve been attempting to train a rasa_core model with about 150k stories, but unfortunately, I’m running out of memory. I’ve tried to use a machine with 400GB of RAM and it still runs out of memory.

My question is: shouldn’t the batch_size define how much data I load to my RAM? Why am I having this problem even with a small batch_size ? It seems that rasa_core was implemented such that all data is loaded into memory, but haven’t anyone tried to use large story data yet?

Thanks

Juste · March 7, 2019, 1:30pm

Hey @akari. By default Rasa does data augmentation - it uses the stories in your training data file and creates more training data examples. Do you have augmentation parameter set in the policy configuration? If not, can you try training the bot without the augementation by setting the flag --augmentation 0? Let me know if the issue persists with augmentation 0

akari · March 7, 2019, 2:01pm

Hi @Juste, thanks for answering. Yes, I forgot to mention but I’m using augmentation 0.

But in any case, even if I were using a huge augmentation factor, shouldn’t rasa_core deal with the large data generated and load the data into memory in batches?

Thanks

filiagees · March 11, 2019, 7:48pm

Thanks for your fast reply @Juste

I’m from @akari’s team and we found out our issue. We have a custom training script and we pass an augmentation_factor=0. But we’d passed this flag in Agent.train() instead of Agent.load_data().

We didnt get any warning message because the function train() receive params using **kwargs

What do you think about a warning message of not used parameters (to avoid similar issues in the future)?

We can send a PR if you guys think nice :]

Topic		Replies	Views
[Ask] process killed when training stories on RASA Core Rasa Open Source	8	1743	March 6, 2019
I got "Out of Memory" when train RASA with over 2M examples of NLU Rasa Open Source	1	1137	August 1, 2020
Any way to to check Training process Rasa Open Source	2	663	July 16, 2019
RASA 3.0 large amount of memory Rasa Open Source	0	269	November 20, 2022
Story Loading on multiple cores Rasa Open Source	15	1917	April 16, 2019

Running out of memory with a large number of stories

Related topics