Preparing data for training - single/multi threaded

notrickyd · February 1, 2022, 2:55pm

Hi! Please, give some hints about what is done inside a process that goes after you run train command and you get console output:

Processed story block […] 5 it/s

? It turns out, that each step like above is done within minutes for 1000+ stories. And this process uses only one CPU on a 80 CPUs server.

Yes, I’m aware of solution Multi-thread training and I do understand, that preparation of data goes in a single thread, but tensor-flow tasks run in multiple threads by default.

I just want to understand what is done while preparing data and is it possible in any case speed it up by forcing to use multiple CPUs/threads.

notrickyd · February 8, 2022, 3:39pm

Hi! A week passed. Any tips are still appreciated!

notrickyd · March 30, 2022, 3:37pm

Any updates?

Topic		Replies	Views
More information on rasa core model training process Rasa Open Source	9	1845	August 17, 2018
Multi-thread training Rasa Open Source	2	993	June 2, 2020
Story Blocks never process before Interactive Training with many Checkpoints Rasa Open Source	2	540	March 26, 2022
Has anyone noticed faster training times going from core 0.13.8 to 0.14.5? Rasa Open Source	1	939	June 19, 2019
Story Loading on multiple cores Rasa Open Source	15	1938	April 16, 2019

Preparing data for training - single/multi threaded

Related topics