How to train CRFEntityExtractor faster?

pankti23 · December 20, 2019, 10:26am

I have around 3000+ custom entities in my project. This exponentially increases the training data and rasa nlu trianing is taking upto two days especially the CRFEntityExtractor component, is there a way to multithread this component or improve training speed?

erohmensing · December 20, 2019, 2:20pm

HI @pankti23, what version of rasa are you running? Have you tried playing around with any of the configuration values on the component or the featurizers?

JulianGerhard · December 20, 2019, 2:43pm

Hi @pankti23,

which pipeline components are you using currently?

If you are using spaCy, I’d recommend to split the training process by re-training the underlying spaCy model with your custom entities and then let Rasa simply auto-fill them by specifying them in your config.yml.

Aside @erohmensing 's suggestion to play a little bit with the possible parameters, this at least would avoid retraining your model everytime this way.

Regards Julian

pankti23 · December 20, 2019, 3:40pm

@erohmensing I am using the 1.4.3 version, and I did try tweaking the parameters however it still took a while. I have currently started using the tensorflow gpu for the training. Thank you, closing the issue for now. @JulianGerhard thank you, looks like that should work the next time

Topic		Replies	Views
CRFEntityExtractor how much time take to complete Welcome to the Rasa Community Forum!	5	874	December 31, 2019
Entity extraction training time exploded using roles for entitites Rasa Open Source	6	573	June 4, 2020
All core usage in RASA Rasa Open Source	4	811	November 13, 2020
Entity tagging for large datasets Rasa Open Source	11	2814	May 17, 2019
Rasa Extracting unsupervised entities Rasa Open Source	5	1086	August 23, 2019

How to train CRFEntityExtractor faster?

Related topics