Does DIET architecture work effective with a larger dataset?

bharatpabba · May 29, 2020, 1:23pm

Hi, I was working a training an NLU model with a dataset consists of 60k utterances with around 400+ intents. I was training this using DIET architecture on p2.16xlarge AWS instance using the diet-heavy.yml config provided here(DIET Benchmarks · GitHub) and used the below command to start the training.

rasa test nlu --config configs/diet-heavy.yml --cross-validation --runs 1 --folds 2 --out results/diet-heavy

After running this command, I’ve observed that the epochs progress stuck at zero and don’t progress further for a long time.

Is the diet architecture in rasa is compatible with larger datasets?

If its possible, can you help me out in figuring out my mistake?

Thank you.

koaning · June 2, 2020, 12:23pm

Could you share your pipeline configuration as well as the output that you see from the terminal? You should still see something of a progress bar even if it is slow.

Also, did you run the lightweight variants as well before running the heavy one?

The heavy settings that you are running are likely running more than just DIET. You’re also running BERT and this can certainly be something that is heavy in production. DIET is designed to also be able to handle larger datasets but I would argue that 400+ intents is a lot. On the intents … just to check … what kind of use-case do you have here? Frequently asked questions?

Topic		Replies	Views
VERY low confidence with DIETClassifier Rasa Open Source	3	888	February 18, 2022
Rasa nlu train with a large dataset is stuck Rasa Open Source	20	2175	April 8, 2020
DIETClassifier: Slow training (like 10 hours) Rasa Open Source	2	1191	May 14, 2020
Rasa 1.10.14 DIETClassifier takes a very long time and it's not using GPU Rasa Open Source	1	841	June 29, 2022
Supervised Embeddings too GOOD to be Deprecated Rasa Open Source	2	545	March 1, 2021

Does DIET architecture work effective with a larger dataset?

Related topics