Hi, I was working a training an NLU model with a dataset consists of 60k utterances with around 400+ intents. I was training this using DIET architecture on p2.16xlarge AWS instance using the diet-heavy.yml config provided here(DIET Benchmarks · GitHub) and used the below command to start the training.
rasa test nlu --config configs/diet-heavy.yml --cross-validation --runs 1 --folds 2 --out results/diet-heavy
After running this command, I’ve observed that the epochs progress stuck at zero and don’t progress further for a long time.
Is the diet architecture in rasa is compatible with larger datasets?
If its possible, can you help me out in figuring out my mistake?
Thank you.